Hardi Thakor, Chandrashekhar Dubey


Social media has become a vital part of people’s life. Due to this, it generates a large amount of data that need to be processed and analyze. Some technologies were not able to handle large volume of data with storage and processing of data thus big data concept comes and handle with large data. So, there should be some mechanisms which classify unstructured data into organized form which helps user to easily access required data. Classification techniques over big data provide required data to the users from large datasets more simple way. Thus handle large amount of data used to Hadoop framework. In order to adapt these techniques for classifying Twitter data into different categories and predict the class from the unknown data. A number of issues and challenges need to be addressed, which are put forward in this paper.


Big Data, Data Mining, Classification Algorithm, Hadoop, MapReduce.

Full Text:



Twitter: https://about.twitter.com/company

Hadoop: https://en.wikipedia.org/wiki/Apache_Hadoop

Chen, Min, Shiwen Mao, and Yunhao Liu, “Big data: A survey”, Mobile Networks and Applications, Volume 19, Issue 2, January 2014.

Kumar, Raj, and Rajesh Verma, “Classification algorithms for data mining: A survey”, International Journal of Innovations in Engineering and Technology (IJIET), Vol. 1 Issue 2 August 2012.

S.Archana, Dr. K.Elangovan, “Survey of Classification Techniques in Data Mining”, International Journal of Computer Science and Mobile Applications, Vol.2 Issue. 2, February- 2014.

M. Sujatha, S. Prabhakar, “A Survey of Classification Techniques in Data Mining”, International Journal of Innovations in Engineering and Technology (IJIET), Vol. 2 Issue 4 August 2013.

Sharma, Seema, “Machine learning techniques for data mining: A survey”, Computational Intelligence and Computing Research (ICCIC), 2013 IEEE International Conference on, December 2013.

Rohit Pitre, Vijay Kolekar, “A Survey Paper on Data Mining With Big Data”, International Journal of Innovative Research in Advanced Engineering (IJIRAE), Volume 1 Issue 1, April 2014.

Greeshma, L., and G. Pradeepini, “Big data analytics with apache hadoop mapreduce framework”, Indian Journal of Science and Technology, Volume 9, Issue 26, July 2016.

Manikandan, Shankar Ganesh, and Siddarth Ravi, “Big data analysis using Apache Hadoop”, IT Convergence and Security (ICITCS), 2014 International Conference on. IEEE, 2014.

Raghuram, M. A., K. Akshay, and K. Chandrasekaran, “Efficient User Profiling in Twitter Social Network Using Traditional Classifiers”, Intelligent Systems Technologies and Applications, Vol 385 August 2015.

Maillo, Jesús, Isaac Triguero, and Francisco Herrera, “A MapReduce-Based k-Nearest Neighbor Approach for Big Data Classification”, Trustcom/BigDataSE/ISPA, Vol 2, December 2015.

Pratama, Bayu Yudha, and Riyanarto Sarno, “Personality classification based on Twitter text using Naive Bayes, KNN and SVM”, International Conference on Data and Software Engineering March 2015.

Madani, Amina, Omar Boussaid, and Djamel Eddine Zegour, “Real-time trending topics detection and description from Twitter content”, Social Network Analysis and Mining, October 2015.

Bello, Gema, et al, “Extracting collective trends from twitter using social-based data mining”, International Conference on Computational Collective Intelligence. Vol 80832013

Anchalia, Prajesh P., and Kaushik Roy, “The k-Nearest Neighbor Algorithm Using MapReduce Paradigm”, 2014 5th International Conference on Intelligent Systems, Modeling and Simulation, October 2014.

Liu, Bingwei, et al, “Scalable sentiment classification for big data analysis using Naive Bayes Classifier”, Big Data, 2013 IEEE International Conference on, December 2013.


  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.