You are here

Identification of MicroRNA biomarkers for cancer by combining multiple feature selection techniques

Download pdf | Full Screen View

Date Issued:
2011
Summary:
MicroRNAs (miRNAs) may serve as diagnostic and predictive biomarkers for cancer. The aim of this study was to identify novel cancer biomarkers from miRNA datasets, in addition to those already known. Three published miRNA cancer datasets (liver, breast, and brain) were evaluated, and the performance of the entire feature set was compared to the performance of individual feature filters, an ensemble of those filters, and a support vector machine (SVM) wrapper. In addition to confirming many known biomarkers, the main contribution of this study is that seven miRNAs have been newly identified by our ensemble methodology as possible important biomarkers for hepatocellular carcinoma or breast cancer, pending wet lab confirmation. These biomarkers were identified from miRNA expression datasets by combining multiple feature selection techniques (i.e., creating an ensemble) or by the SVM-wrapper, and then classified by different learners. Generally speaking, creating a subset of features by selecting only the highest ranking features (miRNAs) improved upon results generated when using all the miRNAs, and the ensemble and SVM-wrapper approaches outperformed individual feature selection methods. Finally, an algorithm to determine the number of top-ranked features to include in the creation of feature subsets was developed. This algorithm takes into account the performance improvement gained by adding additional features compared to the cost of adding those features.
Title: Identification of MicroRNA biomarkers for cancer by combining multiple feature selection techniques.
111 views
21 downloads
Name(s): Kotlarchyk, Alex J.
College of Engineering and Computer Science
Department of Computer and Electrical Engineering and Computer Science
Type of Resource: text
Genre: Electronic Thesis Or Dissertation
Date Issued: 2011
Publisher: Florida Atlantic University
Physical Form: electronic
Extent: ix, 112 p. : ill.
Language(s): English
Summary: MicroRNAs (miRNAs) may serve as diagnostic and predictive biomarkers for cancer. The aim of this study was to identify novel cancer biomarkers from miRNA datasets, in addition to those already known. Three published miRNA cancer datasets (liver, breast, and brain) were evaluated, and the performance of the entire feature set was compared to the performance of individual feature filters, an ensemble of those filters, and a support vector machine (SVM) wrapper. In addition to confirming many known biomarkers, the main contribution of this study is that seven miRNAs have been newly identified by our ensemble methodology as possible important biomarkers for hepatocellular carcinoma or breast cancer, pending wet lab confirmation. These biomarkers were identified from miRNA expression datasets by combining multiple feature selection techniques (i.e., creating an ensemble) or by the SVM-wrapper, and then classified by different learners. Generally speaking, creating a subset of features by selecting only the highest ranking features (miRNAs) improved upon results generated when using all the miRNAs, and the ensemble and SVM-wrapper approaches outperformed individual feature selection methods. Finally, an algorithm to determine the number of top-ranked features to include in the creation of feature subsets was developed. This algorithm takes into account the performance improvement gained by adding additional features compared to the cost of adding those features.
Identifier: 773920415 (oclc), 3332260 (digitool), FADT3332260 (IID), fau:3783 (fedora)
Note(s): by Alex Kotlarchyk.
Thesis (Ph.D.)--Florida Atlantic University, 2011.
Includes bibliography.
Electronic reproduction. Boca Raton, Fla., 2011. Mode of access: World Wide Web.
Subject(s): Gene silencing
Biochemical markers
Cancer -- Diagnosis -- Data processing
Cancer -- Diagnosis -- Research
Gene expression
Tumor markers -- Diagnostic use
Persistent Link to This Record: http://purl.flvc.org/FAU/3332260
Use and Reproduction: http://rightsstatements.org/vocab/InC/1.0/
Host Institution: FAU