Abstract
In the spirit of reporting valid and reliable Quantitative Structure-Activity Relationship (QSAR) models, the aim of our research was to assess how the leverage (analysis with Hat matrix, hi) and the influential (analysis with Cook’s distance, Di) of QSAR models may reflect the models reliability and their characteristics. The datasets included in this research were collected from previously published papers. Seven datasets which accomplished the imposed inclusion criteria were analyzed. Three models were obtained for each dataset (full-model, hi-model and Di-model) and several statistical validation criteria were applied to the models. In 5 out of 7 sets the correlation coefficient increased when compounds with either hi or Di higher than the threshold were removed. Withdrawn compounds varied from 2 to 4 for himodels and from 1 to 13 for Di-models. Validation statistics showed that Di-models possess systematically better agreement than both full-models and hi-models. Removal of influential compounds from training set significantly improves the model and is recommended to be conducted in the process of quantitative structure-activity relationships developing. Cook’s distance approach should be combined with hat matrix analysis in order to identify the compounds candidates for removal.
Keywords: Influential points, leverage effect, model sensitivity, model validation, quantitative structure-activity relationship (QSAR), Cook’s distance.
Combinatorial Chemistry & High Throughput Screening
Title:The Effect of Leverage and/or Influential on Structure-Activity Relationships
Volume: 16 Issue: 4
Author(s): Sorana D. Bolboaca and Lorentz Jantschi
Affiliation:
Keywords: Influential points, leverage effect, model sensitivity, model validation, quantitative structure-activity relationship (QSAR), Cook’s distance.
Abstract: In the spirit of reporting valid and reliable Quantitative Structure-Activity Relationship (QSAR) models, the aim of our research was to assess how the leverage (analysis with Hat matrix, hi) and the influential (analysis with Cook’s distance, Di) of QSAR models may reflect the models reliability and their characteristics. The datasets included in this research were collected from previously published papers. Seven datasets which accomplished the imposed inclusion criteria were analyzed. Three models were obtained for each dataset (full-model, hi-model and Di-model) and several statistical validation criteria were applied to the models. In 5 out of 7 sets the correlation coefficient increased when compounds with either hi or Di higher than the threshold were removed. Withdrawn compounds varied from 2 to 4 for himodels and from 1 to 13 for Di-models. Validation statistics showed that Di-models possess systematically better agreement than both full-models and hi-models. Removal of influential compounds from training set significantly improves the model and is recommended to be conducted in the process of quantitative structure-activity relationships developing. Cook’s distance approach should be combined with hat matrix analysis in order to identify the compounds candidates for removal.
Export Options
About this article
Cite this article as:
Bolboaca D. Sorana and Jantschi Lorentz, The Effect of Leverage and/or Influential on Structure-Activity Relationships, Combinatorial Chemistry & High Throughput Screening 2013; 16 (4) . https://dx.doi.org/10.2174/1386207311316040003
DOI https://dx.doi.org/10.2174/1386207311316040003 |
Print ISSN 1386-2073 |
Publisher Name Bentham Science Publisher |
Online ISSN 1875-5402 |
Call for Papers in Thematic Issues
Artificial Intelligence Methods for Biomedical, Biochemical and Bioinformatics Problems
Recently, a large number of technologies based on artificial intelligence have been developed and applied to solve a diverse range of problems in the areas of biomedical, biochemical and bioinformatics problems. By utilizing powerful computing resources and massive amounts of data, methods based on artificial intelligence can significantly improve the ...read more
Eco-friendly Agents for Biological Control of Pathogenic Diseases
The discovery of an alternative biological approach to disease management includes work on medicinal products derived from natural sources as a starting point for the development of eco-friendly agents for these diseases and the injuries they cause, as well as reducing human contact with hazardous chemicals and their residues. We ...read more
Emerging trends in diseases mechanisms, noble drug targets and therapeutic strategies: focus on immunological and inflammatory disorders
Recently infectious and inflammatory diseases have been a key concern worldwide due to tremendous morbidity and mortality world Wide. Recent, nCOVID-9 pandemic is a good example for the emerging infectious disease outbreak. The world is facing many emerging and re-emerging diseases out breaks at present however, there is huge lack ...read more
Exploring Spectral Graph Theory in Combinatorial Chemistry
Scope of the Thematic Issue: Combinatorial chemistry involves the synthesis and analysis of a large number of diverse compounds simultaneously. Traditional methods rely on brute force experimentation, which can be time-consuming and resource-intensive. Spectral Graph Theory, a branch of mathematics dealing with the properties of graphs in relation to the ...read more
- Author Guidelines
- Graphical Abstracts
- Fabricating and Stating False Information
- Research Misconduct
- Post Publication Discussions and Corrections
- Publishing Ethics and Rectitude
- Increase Visibility of Your Article
- Archiving Policies
- Peer Review Workflow
- Order Your Article Before Print
- Promote Your Article
- Manuscript Transfer Facility
- Editorial Policies
- Allegations from Whistleblowers
Related Articles
-
Immunotherapy in Invasive Fungal Infection - Focus on Invasive Aspergillosis
Current Pharmaceutical Design Smart Biodecorated Hybrid Nanoparticles
Current Bionanotechnology (Discontinued) Review of Current Chemoinformatic Tools for Modeling Important Aspects of CYPsmediated Drug Metabolism. Integrating Metabolism Data with Other Biological Profiles to Enhance Drug Discovery
Current Drug Metabolism In Silico Study of Chromatographic Lipophilicity Parameters of 3-(4-Substituted Benzyl)-5-Phenylhydantoins
Combinatorial Chemistry & High Throughput Screening Psychological Impact of Coronavirus (COVID-19) Disease on Cancer Patients
Coronaviruses Anti-infective and Antineoplastic Properties of Green Tea Catechins: Examining the Therapeutic Risk-benefit Ratio
Current Nutraceuticals The Role of Mass Spectrometry in the Discovery of Antibiotics and Bacterial Resistance Mechanisms: Proteomics and Metabolomics Approaches
Current Medicinal Chemistry Update on Recent Developments in Small Molecular HIV-1 RNase H Inhibitors (2013-2016): Opportunities and Challenges
Current Medicinal Chemistry The Role of Life Events and HPA Axis in Anxiety Disorders: A Review
Current Pharmaceutical Design Induced Depressive Disorder Following the First Dose of COVID-19 Vaccine
CNS & Neurological Disorders - Drug Targets The Use of Beneficial Microbial Endophytes for Plant Biomass and Stress Tolerance Improvement
Recent Patents on Biotechnology Deep Learning in the Quest for Compound Nomination for Fighting COVID-19
Current Medicinal Chemistry Molecular Recognition of Human Angiotensin-Coverting Enzyme I (hACE I) and Different Inhibitors
Current Topics in Medicinal Chemistry Soft Antibacterial Agents
Current Medicinal Chemistry Nitric Oxide in Asthma Therapy
Current Pharmaceutical Design Current Advances and Therapeutic Potential of Agents Targeting Dipeptidyl Peptidases-IV, -II, 8/9 and Fibroblast Activation Protein
Current Topics in Medicinal Chemistry Structure-Activity Relationships of Histamine H2 Receptor Ligands+
Mini-Reviews in Medicinal Chemistry Anti-Endotoxin Agents. 2. Pilot High-Throughput Screening for Novel Lipopolysaccharide-Recognizing Motifs in Small Molecules
Combinatorial Chemistry & High Throughput Screening Small Molecule p38 MAP Kinase Inhibitors for the Treatment of Inflammatory Diseases: Novel Structures and Developments During 2006- 2008
Current Topics in Medicinal Chemistry Non-Peptidic Small-Molecule Antagonists of the Human Platelet Thrombin Receptor PAR-1
Current Medicinal Chemistry - Cardiovascular & Hematological Agents