Introduction: In the recent scenario, machine learning is considered a prevailing area in the diverse fields of science and
technology include image processing, automobiles, banking, finance, etc. The availability of data and adverse
improvements over machine learning techniques have become more feasible to understand and to work on various
channels of real-time health analytics.
Method: In this paper, a health status prediction system is proposed to detect cardiovascular diseases through patients’
tweets. Further, analytics will be carried on a distributed Apache Spark framework to reduce the time taken for both
training and testing when compared with regular standalone machines.
Results: Performance of the proposed framework with Extreme Learning Machine (ELM) - Tree classifier is evaluated on
two different corpora, and which outperforms other classifiers such as Decision Trees, Naïve Bayes, and Linear SVC,
DNN, etc. in both accuracy and time.
Discussion: Social media streaming data is considered as one of the major sources for data in the proposed system. Based
on the model, the attributes of the incoming user tweets are analyzed, and accordingly cardiovascular risk is predicted.
Further, current health status is tweeted back as a reply to the respective user along with a copy to the family and
Conclusion: This proposed work provides the development of the alert-based system for heart status prediction by adding
some additional features impacting the accuracy besides reducing the response time by using Big data Apache Spark