SUMMARIZING DATA BY USING DATA MINING TECHNIQUES A COMPARATIVE BY USING C4.5 AND C5.0 ALGORITHMS
Keywords:
Machine Learning, Data mining, C4.5, C5.0, Gain ratioAbstract
This paper describes the theoretical assues of the data mining concept and their development steps. The differences between clustering and classification process are identified. The practical sides of C4.5 and C5.0 classifiers one dealt with and a comparative study is held. The results of applying both classifiers on 30 patients test serums are illustrated and compared via both classifiers. The comparison highlights the superiority of the C4.5 algorithm in presenting high resolution.
References
Adhatrao, Kalpesh and Gaykar, Aditya and Dhawan,Amiraj and Jha, Rohit and Honrao, Vipul (2013),'PREDICTING STUDENTS PERFORMANCE USING ID3
AND C4.5 CLASSIFICATION ALGORITHMS', International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.3, No.5, September 2013, PP, 2,4.
BANKERT, RICHARD L., &HADJIMICHAEL, MICHAEL & KUCIAUSKAS, ARUNAS P., & THOMPSON, WILLIAM T & RICHARDSON, KIM (2004), 'Remote
Cloud Ceiling Assessment Using data - mining Methods' ,Journal of Applied Meterology, Volume 43, P. 1935, Naval Research Laboratory Monterey, California,
USA.
Dai, Wei and Ji, Wei(2014),"A MapReduce Implementation of C4.5 Decision Tree Algorithm", International Journal of Database Theory and Application Vol.7, No.1 ,P.4.
http://dx.doi.org/10.14257/ijdta.2014.7.1.0
Gupta, Sita&Todwal, Vinod (2012), "Web Data Mining & Applications", International Journal of Engineering and Advanced Technology (IJEAT), ISSN: 2249-8958, Volume-
, Issue-3,P. 20, February 2012.
Hamilton H. and Gurak, E. and Findlater, L. and Olive, W.(2012),"Overview of Decision Trees", Tutorials on decision trees, RudjerBoskovic Institute, P.3 University of California,
California- USA, http://.machine-learning.martinsewell.com/machine-learning.pdf.
Keller ,Frank (2001), "Clustering-Connectionist and Statistical Language Processing", Computerlinguistik, Universit¨at des Saarla,P.1 Web site: www.coli.unisaarland.
de/~crocker/Teaching/Connectionist/lecture13_4up.pdf
Patil, Nilima&Lathi, Rekha&Chitre, Vidya (2012), "Comparison of C5.0 & CART Classification algorithms using pruning technique", International Journal of Engineering
Research & Technology (IJERT), Vol. 1 Issue 4, June – 2012, Mumbai-India.
Pandya, Rutvija&Pandya, Jayati (2015), " C5.0 Algorithm to Improved Decision Tree with Feature Selection and Reduced Error Pruning", International Journal of Computer
Applications (0975 – 8887), Volume 117 – No. 16, P.2.
Sewell, Martin (2007) "Machine Learning", Department of Computer Science, P.1 University College London, Web Site:www.broadinstitute.org/annotation/wintercourse_
Shuweihdi, Farag (2009)," Clustering and Classification with Shape Examples" Ph.Dtheseis, University of Leeds- Department of Statistics, P.2, UK. Web site:
http://www1.maths.leeds.ac.uk/statistics.
Suknovic, Milija&Delibasic, Boris &Jovanovic, Milos&Vukicevic, Milan & Becejski-Vujaklija, Dragana, &Obradovic, Zoran (2012), "Reusable components in
decision tree induction algorithms", Comput Stat (2012) 27:127–148, Springer-Verlag 2011.
Zheng, Yujie (2012), " Clustering Methods in Data Mining with its Applications in High Education", International Conference on Education Technology and Computer
(ICETC2012), IPCSIT vol.43 (2012) © (2012) IACSIT Press, Singapore.
Books:
Nisbet, & Elder, John & Miner, Gary(2009) "Handbook of Statistical Analysis & Data Mining Applications", Academic Press/Elsevier, ISBN 9780123747655, P.12.
Adrian P. &Zantinge D.,(2010), "Data Mining",P.1, www.slideshare.net.
Additional Files
Published
How to Cite
Issue
Section
License
Copyright (c) 2021 International Education and Research Journal (IERJ)

This work is licensed under a Creative Commons Attribution 4.0 International License.