Abstract
In the spirit of reporting valid and reliable Quantitative Structure-Activity Relationship (QSAR) models, the aim of our research was to assess how the leverage (analysis with Hat matrix, hi) and the influential (analysis with Cook’s distance, Di) of QSAR models may reflect the models reliability and their characteristics. The datasets included in this research were collected from previously published papers. Seven datasets which accomplished the imposed inclusion criteria were analyzed. Three models were obtained for each dataset (full-model, hi-model and Di-model) and several statistical validation criteria were applied to the models. In 5 out of 7 sets the correlation coefficient increased when compounds with either hi or Di higher than the threshold were removed. Withdrawn compounds varied from 2 to 4 for himodels and from 1 to 13 for Di-models. Validation statistics showed that Di-models possess systematically better agreement than both full-models and hi-models. Removal of influential compounds from training set significantly improves the model and is recommended to be conducted in the process of quantitative structure-activity relationships developing. Cook’s distance approach should be combined with hat matrix analysis in order to identify the compounds candidates for removal.
Keywords: Influential points, leverage effect, model sensitivity, model validation, quantitative structure-activity relationship (QSAR), Cook’s distance.
Combinatorial Chemistry & High Throughput Screening
Title:The Effect of Leverage and/or Influential on Structure-Activity Relationships
Volume: 16 Issue: 4
Author(s): Sorana D. Bolboaca and Lorentz Jantschi
Affiliation:
Keywords: Influential points, leverage effect, model sensitivity, model validation, quantitative structure-activity relationship (QSAR), Cook’s distance.
Abstract: In the spirit of reporting valid and reliable Quantitative Structure-Activity Relationship (QSAR) models, the aim of our research was to assess how the leverage (analysis with Hat matrix, hi) and the influential (analysis with Cook’s distance, Di) of QSAR models may reflect the models reliability and their characteristics. The datasets included in this research were collected from previously published papers. Seven datasets which accomplished the imposed inclusion criteria were analyzed. Three models were obtained for each dataset (full-model, hi-model and Di-model) and several statistical validation criteria were applied to the models. In 5 out of 7 sets the correlation coefficient increased when compounds with either hi or Di higher than the threshold were removed. Withdrawn compounds varied from 2 to 4 for himodels and from 1 to 13 for Di-models. Validation statistics showed that Di-models possess systematically better agreement than both full-models and hi-models. Removal of influential compounds from training set significantly improves the model and is recommended to be conducted in the process of quantitative structure-activity relationships developing. Cook’s distance approach should be combined with hat matrix analysis in order to identify the compounds candidates for removal.
Export Options
About this article
Cite this article as:
Bolboaca D. Sorana and Jantschi Lorentz, The Effect of Leverage and/or Influential on Structure-Activity Relationships, Combinatorial Chemistry & High Throughput Screening 2013; 16 (4) . https://dx.doi.org/10.2174/1386207311316040003
DOI https://dx.doi.org/10.2174/1386207311316040003 |
Print ISSN 1386-2073 |
Publisher Name Bentham Science Publisher |
Online ISSN 1875-5402 |
Call for Papers in Thematic Issues
Artificial Intelligence Methods for Biomedical, Biochemical and Bioinformatics Problems
Recently, a large number of technologies based on artificial intelligence have been developed and applied to solve a diverse range of problems in the areas of biomedical, biochemical and bioinformatics problems. By utilizing powerful computing resources and massive amounts of data, methods based on artificial intelligence can significantly improve the ...read more
Eco-friendly Agents for Biological Control of Pathogenic Diseases
The discovery of an alternative biological approach to disease management includes work on medicinal products derived from natural sources as a starting point for the development of eco-friendly agents for these diseases and the injuries they cause, as well as reducing human contact with hazardous chemicals and their residues. We ...read more
Emerging trends in diseases mechanisms, noble drug targets and therapeutic strategies: focus on immunological and inflammatory disorders
Recently infectious and inflammatory diseases have been a key concern worldwide due to tremendous morbidity and mortality world Wide. Recent, nCOVID-9 pandemic is a good example for the emerging infectious disease outbreak. The world is facing many emerging and re-emerging diseases out breaks at present however, there is huge lack ...read more
Exploring Spectral Graph Theory in Combinatorial Chemistry
Scope of the Thematic Issue: Combinatorial chemistry involves the synthesis and analysis of a large number of diverse compounds simultaneously. Traditional methods rely on brute force experimentation, which can be time-consuming and resource-intensive. Spectral Graph Theory, a branch of mathematics dealing with the properties of graphs in relation to the ...read more
- Author Guidelines
- Graphical Abstracts
- Fabricating and Stating False Information
- Research Misconduct
- Post Publication Discussions and Corrections
- Publishing Ethics and Rectitude
- Increase Visibility of Your Article
- Archiving Policies
- Peer Review Workflow
- Order Your Article Before Print
- Promote Your Article
- Manuscript Transfer Facility
- Editorial Policies
- Allegations from Whistleblowers
Related Articles
-
Development and Application of Fluorescence Polarization Assays in Drug Discovery
Combinatorial Chemistry & High Throughput Screening Chemoinfectomics in Drug Design and Development
Anti-Infective Agents A Review on Mechanisms of Anti Tumor Activity of Chalcones
Anti-Cancer Agents in Medicinal Chemistry A Brief Overview on Chemistry and Biology of Benzoxepine
Letters in Drug Design & Discovery Computer Design of Vaccines: Approaches, Software Tools and Informational Resources
Current Computer-Aided Drug Design Developing Antitumor Magnetic Hyperthermia: Principles, Materials and Devices
Recent Patents on Anti-Cancer Drug Discovery COVID-19 Vaccines in Clinical Trials and their Mode of Action for Immunity against the Virus
Current Pharmaceutical Design Cytotoxic Constituents of the Vietnamese Sea Snail Monodonta labio (Linnaeus, 1758)
Letters in Organic Chemistry Synthesis of Some New Benzimidazole Derivatives with their Antioxidant Activities
Letters in Organic Chemistry Multi-Targeted Histone Deacetylase Inhibitors in Cancer Therapy
Current Medicinal Chemistry A Review on Rheumatoid Arthritis Interventions and Current Developments
Current Drug Targets Withdrawal Notice: Current scenario of COVID-19
Letters in Drug Design & Discovery The Role of Amino Acids in the Modulation of Cardiac Metabolism During Ischemia and Heart Failure
Current Pharmaceutical Design Heat Shock Protein 90 Inhibitors as Therapeutic Agents
Recent Patents on Anti-Cancer Drug Discovery Is there Any Correlation Between Binding and Functional Effects at the Translocator Protein (TSPO) (18 kDa)?
Current Molecular Medicine Management of Food-Induced Anaphylaxis: Unsolved Challenges
Current Clinical Pharmacology Ethnobotany, Pharmacological Activities and Bioavailability Studies on “King of Bitters” (Kalmegh): A Review (2010-2020)
Combinatorial Chemistry & High Throughput Screening Nitric Oxide and Disorders of the Erythrocyte: Emerging Roles and Therapeutic Targets
Cardiovascular & Hematological Disorders-Drug Targets The Role of Celecoxib as a Potential Inhibitor in the Treatment of Inflammatory Diseases - A Review
Current Medicinal Chemistry Metabolic Engineering in Isoquinoline Alkaloid Biosynthesis
Current Pharmaceutical Biotechnology