Abstract
Virtual screening is an indispensable tool to cope with the massive amount of data being tossed by the high throughput omics technologies. With the objective of enhancing the automation capability of virtual screening process a robust portal termed MegaMiner has been built using the cloud computing platform wherein the user submits a text query and directly accesses the proposed lead molecules along with their drug-like, lead-like and docking scores. Textual chemical structural data representation is fraught with ambiguity in the absence of a global identifier. We have used a combination of statistical models, chemical dictionary and regular expression for building a disease specific dictionary. To demonstrate the effectiveness of this approach, a case study on malaria has been carried out in the present work. MegaMiner offered superior results compared to other text mining search engines, as established by F score analysis. A single query term 'malaria' in the portlet led to retrieval of related PubMed records, protein classes, drug classes and 8000 scaffolds which were internally processed and filtered to suggest new molecules as potential anti-malarials. The results obtained were validated by docking the virtual molecules into relevant protein targets. It is hoped that MegaMiner will serve as an indispensable tool for not only identifying hidden relationships between various biological and chemical entities but also for building better corpus and ontologies.
Keywords: Chemoinformatics, cloud computing, malaria, text mining, virtual screening.
Combinatorial Chemistry & High Throughput Screening
Title:MegaMiner: A Tool for Lead Identification Through Text Mining Using Chemoinformatics Tools and Cloud Computing Environment
Volume: 18 Issue: 6
Author(s): Muthukumarasamy Karthikeyan, Yogesh Pandit, Deepak Pandit and Renu Vyas
Affiliation:
Keywords: Chemoinformatics, cloud computing, malaria, text mining, virtual screening.
Abstract: Virtual screening is an indispensable tool to cope with the massive amount of data being tossed by the high throughput omics technologies. With the objective of enhancing the automation capability of virtual screening process a robust portal termed MegaMiner has been built using the cloud computing platform wherein the user submits a text query and directly accesses the proposed lead molecules along with their drug-like, lead-like and docking scores. Textual chemical structural data representation is fraught with ambiguity in the absence of a global identifier. We have used a combination of statistical models, chemical dictionary and regular expression for building a disease specific dictionary. To demonstrate the effectiveness of this approach, a case study on malaria has been carried out in the present work. MegaMiner offered superior results compared to other text mining search engines, as established by F score analysis. A single query term 'malaria' in the portlet led to retrieval of related PubMed records, protein classes, drug classes and 8000 scaffolds which were internally processed and filtered to suggest new molecules as potential anti-malarials. The results obtained were validated by docking the virtual molecules into relevant protein targets. It is hoped that MegaMiner will serve as an indispensable tool for not only identifying hidden relationships between various biological and chemical entities but also for building better corpus and ontologies.
Export Options
About this article
Cite this article as:
Karthikeyan Muthukumarasamy, Pandit Yogesh, Pandit Deepak and Vyas Renu, MegaMiner: A Tool for Lead Identification Through Text Mining Using Chemoinformatics Tools and Cloud Computing Environment, Combinatorial Chemistry & High Throughput Screening 2015; 18 (6) . https://dx.doi.org/10.2174/1386207318666150703113525
DOI https://dx.doi.org/10.2174/1386207318666150703113525 |
Print ISSN 1386-2073 |
Publisher Name Bentham Science Publisher |
Online ISSN 1875-5402 |
Call for Papers in Thematic Issues
Artificial Intelligence Methods for Biomedical, Biochemical and Bioinformatics Problems
Recently, a large number of technologies based on artificial intelligence have been developed and applied to solve a diverse range of problems in the areas of biomedical, biochemical and bioinformatics problems. By utilizing powerful computing resources and massive amounts of data, methods based on artificial intelligence can significantly improve the ...read more
Eco-friendly Agents for Biological Control of Pathogenic Diseases
The discovery of an alternative biological approach to disease management includes work on medicinal products derived from natural sources as a starting point for the development of eco-friendly agents for these diseases and the injuries they cause, as well as reducing human contact with hazardous chemicals and their residues. We ...read more
Emerging trends in diseases mechanisms, noble drug targets and therapeutic strategies: focus on immunological and inflammatory disorders
Recently infectious and inflammatory diseases have been a key concern worldwide due to tremendous morbidity and mortality world Wide. Recent, nCOVID-9 pandemic is a good example for the emerging infectious disease outbreak. The world is facing many emerging and re-emerging diseases out breaks at present however, there is huge lack ...read more
Exploring Spectral Graph Theory in Combinatorial Chemistry
Scope of the Thematic Issue: Combinatorial chemistry involves the synthesis and analysis of a large number of diverse compounds simultaneously. Traditional methods rely on brute force experimentation, which can be time-consuming and resource-intensive. Spectral Graph Theory, a branch of mathematics dealing with the properties of graphs in relation to the ...read more
- Author Guidelines
- Graphical Abstracts
- Fabricating and Stating False Information
- Research Misconduct
- Post Publication Discussions and Corrections
- Publishing Ethics and Rectitude
- Increase Visibility of Your Article
- Archiving Policies
- Peer Review Workflow
- Order Your Article Before Print
- Promote Your Article
- Manuscript Transfer Facility
- Editorial Policies
- Allegations from Whistleblowers
Related Articles
-
Editorial:Hot Topic: [Anti-Cancer Drugs(Executive Editor: Elke Bergmann-Leitner)]
Current Pharmaceutical Design ras Genes and Human Cancer: Different Implications and Different Roles
Current Genomics Nanotechnology in Cancer Diagnostics and Therapeutics: A Review
Current Pharmaceutical Biotechnology The Role of Nitric Oxide (NO) in Stability Regulation of Hypoxia Inducible Factor-1α (HIF-1α)
Current Medicinal Chemistry New Developments in Anti-Platelet Therapies Potential Use of CD39/Vascular ATP Diphosphohydrolase in Thrombotic Disorders
Current Drug Targets From Fragment Screening to Potent Binders: Strategies for Fragment-to-Lead Evolution
Mini-Reviews in Medicinal Chemistry Fc Receptor Signaling in Leukocytes: Role in Host Defense and Immune Regulation
Current Immunology Reviews (Discontinued) The Kinase Inhibitor Imatinib - An Immunosuppressive Drug?
Current Cancer Drug Targets Adiponectin: Merely a Bystander or the Missing Link to Cardiovascular Disease?
Current Topics in Medicinal Chemistry Orai1 and Transient Receptor Potential Channels as Novel Molecular Targets to Impair Tumor Neovascularization in Renal Cell Carcinoma and other Malignancies
Anti-Cancer Agents in Medicinal Chemistry Immunomodulatory Effects of Bifidobacterium longum W11 Produced Exopolysaccharide on Cytokine Production
Current Pharmaceutical Biotechnology Utility of Measuring Serum Concentrations of Anti-TNF Agents and Anti-Drug Antibodies in Inflammatory Bowel Disease
Current Drug Metabolism Viral Reservoirs an Impediment to HAART: New Strategies to Eliminate HIV-1
Current Drug Targets - Infectious Disorders Scientific Prediction and Prophetic Patenting in Drug Discovery
Recent Patents on CNS Drug Discovery (Discontinued) A Case of Severe Transaminase Elevation Following a Single Ustekinumab Dose with Remission After Drug Withdrawal
Current Drug Safety The Role of Oxidative Stress in Anti-tumor Necrosis Factor Antibody Treatment in Crohn´s Disease
Current Medicinal Chemistry Modulation by Licofelone and Celecoxib of Experimentally Induced Cancer and Preneoplastic Lesions in Mice Exposed to Cigarette Smoke
Current Cancer Drug Targets Computer-Aided Drug Design Applied to Secondary Metabolites as Anticancer Agents
Current Topics in Medicinal Chemistry Mediterranean Diet and Longevity
Current Nutrition & Food Science Pharmacological Inhibition of Poly(ADP-ribose) Polymerase (PARP) Activity in PARP-1 Silenced Tumour Cells Increases Chemosensitivity to Temozolomide and to a N3-Adenine Selective Methylating Agent
Current Cancer Drug Targets