BIG DATA SOURCES AND DATA MINING

Anusha Prem, P. Jayanthi

Abstract


Data mining is vial in every way in every industry starting from small vendors to the large MNCs. Big data is comprises large-volume, complex, growing data sets with multiple, heterogeneous, autonomous sources. The growth of networking, data storage and data collection capacity are growing in leaps and bounds, this necessitates the need of Big data in all domains. Big data is expanding in all engineering and science domains, including physical, biological and biomedical sciences. This paper presents a HACK theorem that proposes a Big data processing model from the data mining perspective. Also it characterizes the features of Big data revolution. This model is based on data-driven character of information which also involves a demand-driven aggregation of information sources, mining and analysis, user interest modeling, and security and privacy considerations. Throughout this paper we focus on analyzing the challenging issues in the data-driven model and also in the revolution phase of Big data.


Keywords


Data mining, Big data, heterogeneous, revolution, autonomous sources.

Full Text:

PDF

References


R. Ahmed and G. Karypis, “Algorithms for Mining the Evolution of Conserved Relational States in Dynamic Networks,” Knowledge and Information Systems, Dec. 2012.

M.H. Alam, J.W. Ha, and S.K. Lee, “Novel Approaches to Crawling Important Pages Early,” Knowledge and Information Systems, Dec. 2012.

S. Aral and D. Walker, “Identifying Influential and Susceptible Members of Social Networks,”.

S. Banerjee and N. Agarwal, ” Knowledge and Information Systems.

E. Birney, “The Making of ENCODE: Lessons for Big-Data Projects,”.

J. Bollen, H. Mao, and X. Zeng, “Twitter Mood Predicts the Stock Market,” J. Computational Science, 2011.

S. Borgatti, A. Mehra, D. Brass, and G. Labianca, “Network Analysis in the Social Sciences,” Science, 2009.

J. Bughin, M. Chui, and J. Manyika, Clouds, Big Data, and Smart Assets: Ten Tech-Enabled Business Trends to Watch. McKinSey Quarterly, 2010.

R. Chen, K. Sivakumar, and H. Kargupta, “Collective Mining of Bayesian Networks from Distributed Heterogeneous Data,” Knowledge and Information Systems

D. Centola, “The Spread of Behavior in an Online Social Network Experiment,” Science, 2010.


Refbacks

  • There are currently no refbacks.

Comments on this article

View all comments




Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright © 2018 INTERNATIONAL EDUCATION AND RESEARCH JOURNAL