Angebote für die Zielgruppen:
 Alumni 
 Personal 
 

Two-Phase Clustering Strategy for Gene Expression Data Sets



AutorIn[nen]

Habich, Dirk
Wächter, Thomas
Lehner, Wolfgang
Pilarsky, Christian


In :KDML 2006 : 12. Workshop der Fachgruppe Knowledge Discovery, Data Mining und Maschinelles Lernen und des Arbeitskreises Knowledge Discovery ; (Hildesheim) : 2006.10.09-11
LWA 2006 : Lernen - Wissensentdeckung - Adaptivität (Workshop 9.11.10.2006 in Hildesheim) / Martin Schaaf, Klaus-Dieter Althoff [Hrsg.]
mehr...
 

Universität Hildesheim, Institut für Informatik, 2006 (Tagungsbeitrag)

 

 

application/pdf    Download PDF-Datei
7 p.
1074 Kb

 

Abstract/Inhalt

In the context of genome research, the method of gene expression analysis has been used for several years. Related microarray experiments are conducted all over the world, and consequently, a vast amount of microarray data sets are produced. Having access to this variety of repositories, researchers would like to incorporate this data in their analyses to increase the statistical significance of their results. In this paper, we present a new two-phase clustering strategy which is based on the combination of local clustering results to obtain a global clustering. The advantage of such a technique is that each microarray data set can be normalized and clustered separately. The set of different relevant local clustering results is then used to calculate the global clustering result. Furthermore, we present an approach based on technical as well as biological quality measures to determine weighting factors for quantifying the local results proportion within the global result. The better the attested quality of the local results, the stronger their impact on the global result.



© 2004
Impressum | Lageplan |