Abstract
Discrete classification is common in Genomic Signal Processing applications, in particular in classification of discretized gene expression data, and in discrete gene expression prediction and the inference of boolean genomic regulatory networks. Once a discrete classifier is obtained from sample data, its performance must be evaluated through its classification error. In practice, error estimation methods must then be employed to obtain reliable estimates of the classification error based on the available data. Both classifier design and error estimation are complicated, in the case of Genomics, by the prevalence of small-sample data sets in such applications. This paper presents a broad review of the methodology of classification and error estimation for discrete data, in the context of Genomics, focusing on the study of performance in small sample scenarios, as well as asymptotic behavior.
Keywords: Genomics, classification, error estimation, discrete histogram rule, sampling distribution, resubstitution, leaveone-out, ensemble methods, coefficient of determination
Current Genomics
Title: Classification and Error Estimation for Discrete Data
Volume: 10 Issue: 7
Author(s): Ulisses M. Braga-Neto
Affiliation:
Keywords: Genomics, classification, error estimation, discrete histogram rule, sampling distribution, resubstitution, leaveone-out, ensemble methods, coefficient of determination
Abstract: Discrete classification is common in Genomic Signal Processing applications, in particular in classification of discretized gene expression data, and in discrete gene expression prediction and the inference of boolean genomic regulatory networks. Once a discrete classifier is obtained from sample data, its performance must be evaluated through its classification error. In practice, error estimation methods must then be employed to obtain reliable estimates of the classification error based on the available data. Both classifier design and error estimation are complicated, in the case of Genomics, by the prevalence of small-sample data sets in such applications. This paper presents a broad review of the methodology of classification and error estimation for discrete data, in the context of Genomics, focusing on the study of performance in small sample scenarios, as well as asymptotic behavior.
Export Options
About this article
Cite this article as:
Braga-Neto M. Ulisses, Classification and Error Estimation for Discrete Data, Current Genomics 2009; 10 (7) . https://dx.doi.org/10.2174/138920209789208228
DOI https://dx.doi.org/10.2174/138920209789208228 |
Print ISSN 1389-2029 |
Publisher Name Bentham Science Publisher |
Online ISSN 1875-5488 |
Call for Papers in Thematic Issues
Advanced AI Techniques in Big Genomic Data Analysis
The thematic issue on "Advanced AI Techniques in Big Genomic Data Analysis" aims to explore the cutting-edge methodologies and applications of artificial intelligence (AI) in the realm of genomic research, where vast amounts of data pose both challenges and opportunities. This issue will cover a broad spectrum of AI-driven strategies, ...read more
Advanced Computational Algorithms and Artificial Intelligence in Clinical Pharmacogenomics
In the era of personalized medicine, understanding the relationship between genetics and drug response is crucial. This issue delves into innovative methodologies, leveraging deep computational analysis and artificial intelligence, to enhance the field of Clinical Pharmacogenomics. The interdisciplinary approach harnesses the power of advanced high-throughput genotyping technologies, sophisticated computational analysis, ...read more
Applications of Single-cell Sequencing Technology in Reproductive Medicine
Single cell sequencing (SCS) technology utilizes individual cells' genetic material to sequence their genome, transcriptome, and epigenetics at the molecular level. It offers insights into cell heterogeneity and enables the study of limited biological materials. Since its recognition as a valuable technique in 2011, single cell sequencing has yielded numerous ...read more
Big Data in Cancer Research
Cancer is a significant threat to human life and health, remaining a highly aggressive killer. It is a leading cause of death worldwide and represents a crucial medical issue for humanity. However, in the past decade, the effectiveness of new synthetic anticancer agents has not matched the current clinical speculation. ...read more
Related Journals
- Author Guidelines
- Graphical Abstracts
- Fabricating and Stating False Information
- Research Misconduct
- Post Publication Discussions and Corrections
- Publishing Ethics and Rectitude
- Increase Visibility of Your Article
- Archiving Policies
- Peer Review Workflow
- Order Your Article Before Print
- Promote Your Article
- Manuscript Transfer Facility
- Editorial Policies
- Allegations from Whistleblowers
- Announcements