Abstract
In the spirit of reporting valid and reliable Quantitative Structure-Activity Relationship (QSAR) models, the aim of our research was to assess how the leverage (analysis with Hat matrix, hi) and the influential (analysis with Cook’s distance, Di) of QSAR models may reflect the models reliability and their characteristics. The datasets included in this research were collected from previously published papers. Seven datasets which accomplished the imposed inclusion criteria were analyzed. Three models were obtained for each dataset (full-model, hi-model and Di-model) and several statistical validation criteria were applied to the models. In 5 out of 7 sets the correlation coefficient increased when compounds with either hi or Di higher than the threshold were removed. Withdrawn compounds varied from 2 to 4 for himodels and from 1 to 13 for Di-models. Validation statistics showed that Di-models possess systematically better agreement than both full-models and hi-models. Removal of influential compounds from training set significantly improves the model and is recommended to be conducted in the process of quantitative structure-activity relationships developing. Cook’s distance approach should be combined with hat matrix analysis in order to identify the compounds candidates for removal.
Keywords: Influential points, leverage effect, model sensitivity, model validation, quantitative structure-activity relationship (QSAR), Cook’s distance.
Combinatorial Chemistry & High Throughput Screening
Title:The Effect of Leverage and/or Influential on Structure-Activity Relationships
Volume: 16 Issue: 4
Author(s): Sorana D. Bolboaca and Lorentz Jantschi
Affiliation:
Keywords: Influential points, leverage effect, model sensitivity, model validation, quantitative structure-activity relationship (QSAR), Cook’s distance.
Abstract: In the spirit of reporting valid and reliable Quantitative Structure-Activity Relationship (QSAR) models, the aim of our research was to assess how the leverage (analysis with Hat matrix, hi) and the influential (analysis with Cook’s distance, Di) of QSAR models may reflect the models reliability and their characteristics. The datasets included in this research were collected from previously published papers. Seven datasets which accomplished the imposed inclusion criteria were analyzed. Three models were obtained for each dataset (full-model, hi-model and Di-model) and several statistical validation criteria were applied to the models. In 5 out of 7 sets the correlation coefficient increased when compounds with either hi or Di higher than the threshold were removed. Withdrawn compounds varied from 2 to 4 for himodels and from 1 to 13 for Di-models. Validation statistics showed that Di-models possess systematically better agreement than both full-models and hi-models. Removal of influential compounds from training set significantly improves the model and is recommended to be conducted in the process of quantitative structure-activity relationships developing. Cook’s distance approach should be combined with hat matrix analysis in order to identify the compounds candidates for removal.
Export Options
About this article
Cite this article as:
Bolboaca D. Sorana and Jantschi Lorentz, The Effect of Leverage and/or Influential on Structure-Activity Relationships, Combinatorial Chemistry & High Throughput Screening 2013; 16 (4) . https://dx.doi.org/10.2174/1386207311316040003
DOI https://dx.doi.org/10.2174/1386207311316040003 |
Print ISSN 1386-2073 |
Publisher Name Bentham Science Publisher |
Online ISSN 1875-5402 |
Call for Papers in Thematic Issues
Artificial Intelligence Methods for Biomedical, Biochemical and Bioinformatics Problems
Recently, a large number of technologies based on artificial intelligence have been developed and applied to solve a diverse range of problems in the areas of biomedical, biochemical and bioinformatics problems. By utilizing powerful computing resources and massive amounts of data, methods based on artificial intelligence can significantly improve the ...read more
Eco-friendly Agents for Biological Control of Pathogenic Diseases
The discovery of an alternative biological approach to disease management includes work on medicinal products derived from natural sources as a starting point for the development of eco-friendly agents for these diseases and the injuries they cause, as well as reducing human contact with hazardous chemicals and their residues. We ...read more
Emerging trends in diseases mechanisms, noble drug targets and therapeutic strategies: focus on immunological and inflammatory disorders
Recently infectious and inflammatory diseases have been a key concern worldwide due to tremendous morbidity and mortality world Wide. Recent, nCOVID-9 pandemic is a good example for the emerging infectious disease outbreak. The world is facing many emerging and re-emerging diseases out breaks at present however, there is huge lack ...read more
Exploring Spectral Graph Theory in Combinatorial Chemistry
Scope of the Thematic Issue: Combinatorial chemistry involves the synthesis and analysis of a large number of diverse compounds simultaneously. Traditional methods rely on brute force experimentation, which can be time-consuming and resource-intensive. Spectral Graph Theory, a branch of mathematics dealing with the properties of graphs in relation to the ...read more
- Author Guidelines
- Graphical Abstracts
- Fabricating and Stating False Information
- Research Misconduct
- Post Publication Discussions and Corrections
- Publishing Ethics and Rectitude
- Increase Visibility of Your Article
- Archiving Policies
- Peer Review Workflow
- Order Your Article Before Print
- Promote Your Article
- Manuscript Transfer Facility
- Editorial Policies
- Allegations from Whistleblowers
Related Articles
-
A Review on Structures and Functions of Bcl-2 Family Proteins from Homo sapiens
Protein & Peptide Letters Preface
Recent Patents on Engineering Synthesis and Biological Activities of New Halophenols
Medicinal Chemistry Editorial from Editor-in-Chief
Current Respiratory Medicine Reviews 2D Pharmacophoric Design and Synthesis of Novel Pyrimidine Derivatives as Anticonvulsants
Current Bioactive Compounds Antiviral Properties of Quinolone-based Drugs
Current Drug Targets - Infectious Disorders 11C-verapamil to Assess P-gp Function in Human Brain During Aging,Depression and Neurodegenerative Disease
Current Topics in Medicinal Chemistry Drug Repurposing: An Alternative Strategy to Treat COVID-19
Current Pharmaceutical Biotechnology The CRF1 Receptor, a Novel Target for the Treatment of Depression, Anxiety, and Stress-Related Disorders
CNS & Neurological Disorders - Drug Targets Melanocortin-4 Receptor Antagonists as Potential Therapeutics in the Treatment of Cachexia
Current Topics in Medicinal Chemistry COVID-19 Pandemic: How can Computer-assisted Methods help to Rein in this Global Menace?
Current Computer-Aided Drug Design Vascular Inflammation During Human Rickettsioses: An Essential Host Response and a Potential Target for Supplemental Therapy
Anti-Inflammatory & Anti-Allergy Agents in Medicinal Chemistry Gene and Stem Cell Therapy in the Treatment of Erectile Dysfunction and Pulmonary Hypertension; Potential Treatments for the Common Problem of Endothelial Dysfunction
Current Gene Therapy Meta-analysis of Lung Cancer Patients in COVID-19
Current Respiratory Medicine Reviews Forecasting of Covid-19 Cases Using Machine Learning Approach
Current Respiratory Medicine Reviews Recent Advances of α-Glucosidase Inhibitors: A Comprehensive Review
Current Topics in Medicinal Chemistry SAR and Molecular Modeling of N-Benzyl-N-hydroxy-3-(cyclopentyloxy)-4- methoxybenzene Carboxamide Analogues as Potent Phosphodiesterase-4 Inhibitors
Letters in Drug Design & Discovery Repositioning of Drugs to Counter COVID-19 Pandemic - An Insight
Current Pharmaceutical Biotechnology Mitosis-Targeting Natural Products for Cancer Prevention and Therapy
Current Drug Targets Advances in Peptide-based DNA/RNA-Intercalators
Current Protein & Peptide Science