Clustering techniques
June 11, 2008 2:04 PM
Subscribe
What clustering technique to use?
I have a set of roughly 500 curves, each curve representing the numerical representation of the behavior of a transcription factor (represented by its binding motif) along a set of genomic coordinates.
In addition, I have six pre-ordained structural classifications. Each of the 500 transcription factor types is a member of one classification.
Presently, I have performed hierarchical clustering of the distances between the curves. I then color the leaves with the according classification, in order to see how the factors organize.
Is there a way to incorporate information from the structural classifications to help assist clustering? What techniques would be better suited for that?
I looked into k-means clustering, but I'm uncertain how I merge the curve information with, say, a six-dimensional unit vector (each axis being the structural classification) that represents membership to a class.
Thanks for your advice.
posted by Blazecock Pileon to science & nature (6 comments total)
2 users marked this as a favorite
posted by demiurge at 3:11 PM on June 11, 2008