Clustering #7

trevorsummerssmith · 2012-10-13T16:11:17Z

Eric fill out some random thoughts.

epurdy · 2012-10-14T01:46:31Z

The main difficulty with clustering is figuring out an intelligible representation of a cluster. We want to be able to look at a cluster that contains maybe 25% of all the vertices, and have some idea what its "deal" is.

This basically means having some sort of domain-specific "summarization" operators.

OR: this is a weirder idea, but you could try "summarizing by sampling": you show a bunch of random examples from a given cluster. Then you can be pretty sure that the intelligible clusters will "look right" most of the time. Unintelligible clusters will probably at least look unintelligible, because the user won't be able to detect any sort of pattern from the random examples shown.

ghost assigned epurdy Oct 13, 2012

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clustering #7

Clustering #7

trevorsummerssmith commented Oct 13, 2012

epurdy commented Oct 14, 2012

Clustering #7

Clustering #7

Comments

trevorsummerssmith commented Oct 13, 2012

epurdy commented Oct 14, 2012