Generic placeholder image

International Journal of Sensors, Wireless Communications and Control

Editor-in-Chief

ISSN (Print): 2210-3279
ISSN (Online): 2210-3287

Research Article

An Improved Intelligent Approach to Enhance the Sentiment Classifier for Knowledge Discovery Using Machine Learning

Author(s): Midde Venkateswarlu Naik, D. Vasumathi* and A.P. Siva Kumar

Volume 10, Issue 4, 2020

Page: [582 - 593] Pages: 12

DOI: 10.2174/2210327910999200528114552

Price: $65

Abstract

Aims: The proposed research work is on an evolutionary enhanced method for sentiment or emotion classification on unstructured review text in the big data field. The sentiment analysis plays a vital role for current generation of people for extracting valid decision points about any aspect such as movie ratings, education institute or politics ratings, etc. The proposed hybrid approach combined the optimal feature selection using Particle Swarm Optimization (PSO) and sentiment classification through Support Vector Machine (SVM). The current approach performance is evaluated with statistical measures, such as precision, recall, sensitivity, specificity, and was compared with the existing approaches. The earlier authors have achieved an accuracy of sentiment classifier in the English text up to 94% as of now. In the proposed scheme, an average accuracy of sentiment classifier on distinguishing datasets outperformed as 99% by tuning various parameters of SVM, such as constant c value and kernel gamma value in association with PSO optimization technique. The proposed method utilized three datasets, such as airline sentiment data, weather, and global warming datasets, that are publically available. The current experiment produced results that are trained and tested based on 10- Fold Cross-Validations (FCV) and confusion matrix for predicting sentiment classifier accuracy.

Background: The sentiment analysis plays a vital role for current generation people for extracting valid decisions about any aspect such as movie rating, education institute or even politics ratings, etc. Sentiment Analysis (SA) or opinion mining has become fascinated scientifically as a research domain for the present environment. The key area is sentiment classification on semi-structured or unstructured data in distinguish languages, which has become a major research aspect. User-Generated Content [UGC] from distinguishing sources has been hiked significantly with rapid growth in a web environment. The huge user-generated data over social media provides substantial value for discovering hidden knowledge or correlations, patterns, and trends or sentiment extraction about any specific entity. SA is a computational analysis to determine the actual opinion of an entity which is expressed in terms of text. SA is also called as computation of emotional polarity expressed over social media as natural text in miscellaneous languages. Usually, the automatic superlative sentiment classifier model depends on feature selection and classification algorithms.

Methods: The proposed work used Support vector machine as classification technique and particle swarm optimization technique as feature selection purpose. In this methodology, we tune various permutations and combination parameters in order to obtain expected desired results with kernel and without kernel technique for sentiment classification on three datasets, including airline, global warming, weather sentiment datasets, that are freely hosted for research practices.

Results: In the proposed scheme, The proposed method has outperformed with 99.2% of average accuracy to classify the sentiment on different datasets, among other machine learning techniques. The attained high accuracy in classifying sentiment or opinion about review text proves superior effectiveness over existing sentiment classifiers. The current experiment produced results that are trained and tested based on 10- Fold Cross-Validations (FCV) and confusion matrix for predicting sentiment classifier accuracy.

Conclusion: The objective of the research issue sentiment classifier accuracy has been hiked with the help of Kernel-based Support Vector Machine (SVM) based on parameter optimization. The optimal feature selection to classify sentiment or opinion towards review documents has been determined with the help of a particle swarm optimization approach. The proposed method utilized three datasets to simulate the results, such as airline sentiment data, weather sentiment data, and global warming data that are freely available datasets.

Keywords: Sentiment analysis, knowledge discovery, machine learning, support vector machine, particle swarm optimization, global warming data.

Graphical Abstract
[1]
Montoyo A, Martinez-Barco P, Balahur A. Subjectivity and sentiment analysis: An overview of the current state of the area and envisaged developments. Decis Support Syst 2012; 53: 675-9.
[http://dx.doi.org/10.1016/j.dss.2012.05.022]
[2]
Tan S, Wu Q. A Random walk algorithm for automatic construction of domain-oriented sentiment lexicon. Expert Syst Appl 2011; 38: 12094-100.
[http://dx.doi.org/10.1016/j.eswa.2011.02.105]
[3]
Heing-Thurauan T, Kevin PG, Walsh G, Dwayne DG. Electronic word-of-mouth via consumer-opinion platforms: What motivates consumers to articulate themselves on the Internet? J Interact Market 2004; 18(1): 38-52.
[http://dx.doi.org/10.1002/dir.10073]
[4]
Kumar R, Vadlamani R. A survey on opinion mining and sentiment analysis: Tasks, approaches and applications. Knowl Base Syst 2015; 89: 14-46.
[5]
Liang CY, Wang J, Robert LK, Zhang X. Refining word embeddings using intensity scores for sentiment Analysis. IEEE/ACM Trans Audio Speech Lang Process 2018; 26(3): 671-81.
[http://dx.doi.org/10.1109/TASLP.2017.2788182]
[6]
Schouten K, Van der Weijde O, Frasincar F, Dekker R. Supervised and unsupervised aspect category detection for sentiment analysis with co-occurrence data. IEEE Trans Cybern 2018; 48(4): 1263-75.
[http://dx.doi.org/10.1109/TCYB.2017.2688801 PMID: 28422676]
[7]
Liu B. Sentiment analysis: A multifaceted problem. IEEE Intell Syst 2010; 25(3): 39-80.
[http://dx.doi.org/10.1109/MIS.2012.106]
[8]
Sun Z, Fox G. Traffic flow forecasting based on combination of multidimensional scaling and SVM. Int J Intell Transport Syst Res 2014; 12(1): 20-5.
[http://dx.doi.org/10.1007/s13177-013-0065-9]
[9]
Jackand LB, Nandi AK. Fault detection using support vector machines and artificial neural networks, augmented by genetic algorithms. Mech Syst Signal Process 2002; 16(2-3): 373-90.
[http://dx.doi.org/10.1006/mssp.2001.1454]
[10]
Kennedy J, Eberhart R, Shi Y. Swarm Intelligence. San Francisco, Calif, USA: Morgan Kaufmann Publishers 2001.
[11]
Sheng CJ, Min YC, Wu YW, Anh-Duc P. Optimizing Parameters of Support Vector machine using fast messy genetic algorithm for dispute classification. Expert Syst Appl 2014; 41(8): 3955-64.
[12]
Jurek A, Maurice D, Mulvenna YB. Improved lexicon based sentiment analysis for social media analytics. Secur Inform 2015; 4(9): 2-13.
[13]
Devi NK, Jayanthi P. Sentiment classification using SVM and PSO. Int J Adv Eng Technol 2016; 7(2): 411-3.
[14]
Vishal V, Uma V. An extensive study of sentiment analysis tools and binary classification of tweets using Rapid Miner. Procedia Comput Sci 2018; 125: 329-35.
[http://dx.doi.org/10.1016/j.procs.2017.12.044]
[15]
Abd Samad HB, Burairah HI, Gede PA. Junta Zeniarja. Opinion Mining of Movie Review using Hybrid Method of Support Vector Machine and Particle Swarm Optimization. Malaysian Technical Universities Conference on Engineering and Technology (MUCET). Procedia Eng 2013; 53: 453-62.
[16]
Jackand L.B., Nandi A.K. Fault detection using support vector machines and artificial neural networks, augmented by genetic algorithms. Mechl Syst Sign Processi. 2002; 16: 373-9.
[17]
Asriyanti IPA. On the feature selection and classification based on information gain for document sentiment analysis. Appl Comput Intell Soft Comput 2018; 2018: 1-5.
[http://dx.doi.org/10.1155/2018/1407817]
[18]
Yuan CM, Thom HT. Feature selection and parameters optimizations of SVM using particle swarm optimization for fault classification in power distribution systemsComput Intel Neurosc 2017; 1-9.4135465
[19]
Kapil S, Varun J, Ansari MD. Machine learning based support system for students to select stream. J Recent Patents Comput Sci 2019; 12: 1.
[20]
Ansari MD, Ghrera SP. Intuitionistic fuzzy local binary pattern for features extraction. Int J Inf Commun Technol 2018; 13(1): 83-98.
[http://dx.doi.org/10.1504/IJICT.2018.090435]
[21]
Abbasi A, Hsinchun C, Arab S. Sentiment analysis in multiple languages: Feature selection for opinion classification in web forums. ACM Trans Inf Syst(TOIS) 2008; 26(3): 12.
[http://dx.doi.org/10.1145/1361684.1361685]
[22]
Brezocnik L. Feature selection for classification using particle swarm optimization. IEEE EUROCON 2017 -17th International Conference on Smart Technologies Ohrid, Macedonia 2017.
[http://dx.doi.org/10.1109/EUROCON.2017.8011255]
[23]
Fang Y, Tan H, Zhang J. Multi-strategy sentiment analysis of consumer reviews based on semantic fuzzinessIEEE Access 2018; 6: 20625-31.
[http://dx.doi.org/10.1109/ACCESS.2018.2820025]
[24]
Zhao J, Gui X, Zhang X. Deep convolution neural networks for twitter sentiment analysisIEEE Access 2018; 6: 23253-60.
[25]
Chen MH, Chen WF, Ku LW. Application of sentiment analysis to Language Learning IEEE Access 2018; 6: 24433-42.
[http://dx.doi.org/10.1109/ACCESS.2018.2832137]
[26]
Shayaa S, Jaafar NI, Bahri S, et al. Sentiment Analyis of Big Data:Methods, Applications, and Open Challeges IEEE Access 2018; 6: 37807-27.
[27]
Midde VND, Vasumathi AP, Siva K. An enhanced unsupervised learning approach for sentiment analysis using extraction of Tri- Co-occurrence Words Phrases Springer Second International Conference on Computational Intelligence and Informatics, Advances in Intelligent Systems and Computing (AISC)..
[28]
Alekh A, Bhattacharya P. Sentiment analysis: A new approach for the effective use of linguistic knowledge and exploiting similarities in a set of documents to be classified. Proceedings of the international conference Natural Language Processing (ICON).
[29]
Abd. Samad Hasan Basari, Burairah Hussin I, Gede Pramudya Ananta, Junta Zeniarja. Opinion Mining of Movie Review using Hybrid Method of Support Vector Machine and Particle Swarm Optimization. Malaysian Technical Universities Conference on Engineering and Technology (MUCET), Information and Communication Technology, Elsevier Journal of Procedia Engineering. 2013; 53: 453-62.
[30]
Mochamad wahyudii, Dinar Ajeng Kristiyanti, Sentiment Analysis of Smart Phone Product Review using Support Vector Machine Algorithm Based Particle Swarm Optimization. Journal of Theoretical and Applied Information Technology. 2016; 91(1): 189-201.
[31]
Vapnik VN. Nature of Statistical Learning Theory. New York, NY, USA: Springer 1995.
[http://dx.doi.org/10.1007/978-1-4757-2440-0]
[32]
Hsu C, Chang C, Lin C. A practical guide to support vector Classification. Department of Computer Science National Taiwan University 2003.

Rights & Permissions Print Cite
© 2024 Bentham Science Publishers | Privacy Policy