Implementation and Analysis of Classification Algorithms for Diabetes

Author(s): Dilip Kumar Choubey*, Sanchita Paul, Smita Shandilya, Vinay Kumar Dhandhania.

Journal Name: Current Medical Imaging
Formerly: Current Medical Imaging Reviews

Volume 16 , Issue 4 , 2020

Become EABM
Become Reviewer

Graphical Abstract:


Background: In this era of cutting edge research, though one of the ubiquitous facilities accessible to modern man is state of the art medical care yet diabetes has emerged as one of the major ailments afflicting the present generation. So the prime necessity of this age has transformed into providing cheap and sustainable medical care against such major diseases like diabetes. In layman’s terms Diabetes may be defined as a physiological condition wherein the blood glucose level is more than the prescribed level on a regular basis.

Objectives: So the prime objective of this work is to provide a novel classification technique for detection of diabetes in a timely and effective manner.

Methods: The proposed work comprises of four phases: In the first phase a “Localized Diabetes Dataset” has been compiled and collected from Bombay Medical Hall, Mahabir Chowk, Pyada Toli, Upper Bazar, Jharkhand, Ranchi, India. In the second phase various classification techniques namely RBF NN, MLP NN, NBs, and J48graft DT have been applied on the Localized Diabetes Dataset. In the third phase, Genetic algorithm (GA) has been utilized as an attribute selection technique through which six attributes among twelve attributes have been filtered. Lastly in the fourth phase RBF NN, MLP NN, NBs and J48graft DT has been utilized for classification on relevant attributes obtained by GA.

Results: In this study, comparative analysis of outcomes obtained by with and without the use of GA for the same set of classification technique has been done w.r.t performance assessment. It has been conclusively inferred that GA is helpful in removing insignificant attributes, reducing the cost and computation time while enhancing ROC and accuracy.

Conclusion: The utilized strategy may likewise be executed for other medical issues.

Keywords: Localized diabetes dataset, GA, RBF NN, MLP NN, NBs, J48graft DT, PIDD, diagnosis, feature selection, diabetes, classification.

Choubey DK, Paul SGA. _RBF NN: A classification system for diabetes. IJBET 2017; 23(1): 71-93.
Choubey DK, Paul S. In: Shandilya SK, Shandilya S, Deep K, Nagar AK, Eds. Handbook of research on nature inspired soft computing and algorithms. IGI Global. 2017; pp. 359-97.
UCI Repository of Bioinformatics Databases Available from: ML Repository.html
Esin D. An intelligent diagnosis system for diabetes on linear discriminant analysis and adaptive network based fuzzy inference system: LDA-ANFIS. Digit Signal Process 2010; 20: 1248-55.
Kemal P. An expert system approach based on principal component analysis and adaptive neuro-fuzzy inference system to diagnosis of diabetes disease. Digit Signal Process 2007; 17: 702-10.
Manjeevan S. A hybrid intelligent system for medical data classification. Expert Syst Appl 2014; 41: 2239-49.
Orkcu H. Comparing performances of backpropagation and genetic algorithms in the data classification. Expert Syst Appl 2011; 38: 3703-9.
Luukka P. Feature selection using fuzzy entropy measures with similarity classifier. Expert Syst Appl 2011; 38: 4600-7.
Hasan T. A comparative study on diabetes disease diagnosis using neural networks. Expert Syst Appl 2009; 36: 8610-5.
Waqar AW. Feature generation using genetic programming with comparative partner selection for diabetes classification. Expert Syst Appl 2013; 40: 5402-12.
Brito GL. Inverted hierarchical neuro-fuzzy BSP system: a novel neuro-fuzzy model for pattern classification and rule extraction in databases. IEEE Trans Syst Man Cybern C 2006; 36(2): 236-48.
Selvakuberan K, Kayathiri D, Harini B, Indra DM. An efficient feature selection method for classification in Health care Systems using machine learning techniques. In: 3rd International Conference on Electronics Computer Technology 2011 8-10 April;. Kanyakumari, India. IEEE 2011; pp. 223-6.
Quadir SA, Hossain S. Predicting heart-disease from medical data by applying naive Bayes and apriori algorithm. Int J Engine Res 2013; 4(10): 224-31.
Kayaer K, Yildirim T. Medical diagnosis on Pima Indian diabetes using general regression neural networks IEEE 2003; 2003; 181-4
Polat Kemal. A cascade learning system for classification of diabetes disease: Generalized discriminant analysis and least square support vector machine. Expert Syst Appl 2008; 34: 482-7.
Ganji MF. Using fuzzy Ant Colony Optimization for Diagnosis of Diabetes Disease Proceedings of ICEE In: 18th Iranian Conference on Electrical Engineering. 2010 May 11-13; Isfahan, Iran. IEEE 2010; pp. 501-5.
Choubey DK. Paul Sanchita. GA_J48graft DT: A hybrid intelligent system for diabetes disease diagnosis. IJBSBT 2015; 2233- 78497(5): 135-50.
Humar K. Design of a hybrid system for the diabetes and heart diseases. Expert Syst Appl 2008; 35: 82-9.
Abu-Naser SS, Abu Zaiter O. An Expert system for diagnosing eye diseases using clips. JATIT 2008; 2008: 923-30.
Lee C-S, Wang MH. A fuzzy expert system for diabetes decision support application. IEEE Trans Syst Man Cybern B Cybern 2011; 41(1): 139-53.
[] [PMID: 20501347]
Karatsiolis S, Schizas CN. Region based support vector machine algorithm for medical diagnosis on pima indian diabetes dataset. In: 12th International Conference on Bioinformatics & Bioengineering (BIBE). 2012 Nov 11-13; Larnaca, Cyprus. IEEE 2013; pp. 139-4.
Ephzibah EP. Cost effective approach on feature selection using genetic algorithms and fuzzy logic for diabetes diagnosis. Int J Soft Comput 2011; 2(1): 1-10.
Kalaiselvi C, Nasira GM. A new approach for diagnosis of diabetes and prediction of cancer using ANFIS. In: World Congress on Computing and Communication Technologies. 2014 Feb 27-March 1; Trichirappalli, India. IEEE 2014; pp. 188-90.
Noman QS. Radial basis function network based on time variant multi objective particle swarm optimization for medical diseases diagnosis. Appl Soft Comput 2011; 11: 1427-38.
Gowda KA. Application of genetic algorithm optimized neural network connection weights for medical diagnosis of Pima Indians diabetes. Int J Soft Comput 2011; 2(2): 15-23.
Jayalakshmi T, Santhakumaran A. A novel classification method for diagnosis of diabetes mellitus using artificial neural networks. In: International Conference on Data Storage and Data Engineering. Feb 9-10 2010; Bangalore, India. IEEE 2010; pp 159-163.
Choubey DK. Paul Sanchita. GA_MLP NN: A Hybrid Intelligent System for Diabetes Disease Diagnosis International Journal of Intelligent Systems and Applications (IJISA). MECS 2016; 8(1): 49-59.
Barakat NH, Bradley AP, Barakat MN. Intelligible support vector machines for diagnosis of diabetes mellitus. IEEE Trans Inf Technol Biomed 2010; 14(4): 1114-20.
[] [PMID: 20071261]
Barakat NH, Bradley AP. rule extraction from support vector machines: A sequential covering approach. IEEE Trans Knowl Data Eng 2007; 19(6): 729-41.
Kumar CD, Sanchita P, Joy B. soft computing approaches for diabetes disease diagnosis: A survey. IJAER 2014; 9: 11715-26.
Barakat N. Rule extraction from support vector machines: Medical diagnosis prediction and explanation. Ph.D. thesis, School Inf Technol Electr Eng (ITEE) Brisbane, Australia 2007.
Kumar CD. Classification techniques for diagnosis of diabetes disease: A review. IJBET 2016; 21(1): 15-39.
Patil BM, Joshi RC. Association rule for classification of type-2 diabetic patients. In: Second International Conference On Machine Learning And Computing. Feb 9-11 2010; Bangalore, India. IEEE 2010; pp. 330-4.
Hemant P, Pushpavathi T. A novel approach to predict diabetes by cascading clustering and classification computing communication & networking technologies. Third In: International Conference on Computing, Communication and Networking Technologies (ICCCNT’12). July 26-28 2012; Coimbatore, India. IEEE 2012; pp. 1-7.
Daho MEH. Recognition of diabetes disease using a new hybrid learning algorithm for nefclass. In: 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA). 2013 May 12-15; Algiers, Algeria. IEEE 2013; pp. 239-43
Sathasivam S, Ong HC, Hamadneh N. Comparing neural networks: Hopfield network and RBF network. Appl Math Sci 2011; 5(69): 3439-52.
Kala R, Khanwalkar N, Vazirani H, Bhattacharya M. Evolutionary radial basis function network for classificatory problems. Int J Comput Appl 2010; 7(4): 34-49.
Kumar DC, Sanchita P, Santosh K, Shankar K. Classification of pima Indian diabetes dataset using naive bayes with genetic algorithm as an attribute selection. In: Proceedings of the International Conference on Communication and Computing System (ICCCS 2016). London: Taylor & Francis Group 2017; pp. 451-5.
Choubey DK, Sanchita P, Kanchan B, Manish K, Singh UP. In: Bhattacharyya S, Ed. Innovations in multimedia data engineering and management. In: IGI Global 2019; pp. 201-40.
Bala K, Choubey DK, Sanchita P. Soft computing and data mining techniques for thunderstorms and lightning prediction: A survey. In: International Conference of Electronics, Communication and Aerospace Technology (ICECA). ; 2017 20-22 April;. Coimbatore, India. IEEE 2017; pp. 42-46.
Bala K, Choubey DK, Sanchita P, Lala MGN. In: Sing UP, Tiwari A, Singh RK, Eds. Soft computing-based nonlinear control systems design. IGI Global 2018; pp. 1-17.
Tomar PPS, Saxena P. Architecture for medical diagnosis using rule-based technique. In: The First International Conference on Interdisciplinary Research and Development. 2011 2-3 June;. Bangkok, Thailand. IEEE 2011:; pp. 1-5.
Zeki TS. An expert system for diabetes diagnosis. American Acad Scholar Res J 2012; 4(5): 1-13.
Soundararajan K. diagnostics decision support system for tuberculosis using fuzzy logic. IJCSITS 2012; 2(3): 684-9.
Li T-S. Feature Selection for classification by using a GA-Based neural network approach. J Chin Inst Indus Eng 2006; 23(1): 55-64.
Borgohain R, Sanyal S. Rule based expert system for diagnosis of neuromuscular disorders. Int J Adv Network Appl 2012; 4(1): 1509-13.
Jia W, Zhao D, Shen T, et al. A new optimized GA-RBF neural network algorithm. Computational intelligence and neuroscience. Comput Intell Neurosci 2014; 2014: 1-6.
Tuur US, Shamsuddin SMH. Radial basis function network learning with modified back propagation algorithm. Telkomnika Indon J Elect Eng 2014; 13(2): 369-78.
Comak E, Polat K, Güneş S, Arslan A. A new medical decision making system: Least Square Support Vector Machine (LSSVM) with fuzzy weighting pre-processing. Expert Syst Appl 2007; 32: 409-14.
Choubey DK, Paul S, Dhandhenia VK. Rule based diagnosis system for diabetes. Biomed Res Allied Acad 2017; 28(12): 5196-209.
Karegowda A, Jayaram MA, Manjunath AS. Application of genetic algorithm optimized neural network connection weights for medical diagnosis of Pima Indians diabetes. Int J Soft Comput 2011; 2(2): 15-23.
Ramesh V, Padmini R. Risk level prediction system of diabetic retinopathy using classification algorithms. IJSDR 2017; 2(6): 430-5.

Rights & PermissionsPrintExport Cite as

Article Details

Year: 2020
Page: [340 - 354]
Pages: 15
DOI: 10.2174/1573405614666180828115813
Price: $65

Article Metrics

PDF: 15