Publication:
A CART-based genetic algorithm for constructing higher accuracy decision trees

dc.contributor.authorErsoy, Elif
dc.contributor.authorAlbey, Erinç
dc.contributor.authorKayış, Enis
dc.contributor.departmentIndustrial Engineering
dc.contributor.editorHammoudi, S.
dc.contributor.editorQuix, C.
dc.contributor.editorBernardino, J.
dc.contributor.ozuauthorALBEY, Erinç
dc.contributor.ozuauthorKAYIŞ, Enis
dc.contributor.ozugradstudentErsoy, Elif
dc.date.accessioned2021-09-16T07:19:19Z
dc.date.available2021-09-16T07:19:19Z
dc.date.issued2020
dc.description.abstractDecision trees are among the most popular classification methods due to ease of implementation and simple interpretation. In traditional methods like CART (classification and regression tree), ID4, C4.5; trees are constructed by myopic, greedy top-down induction strategy. In this strategy, the possible impact of future splits in the tree is not considered while determining each split in the tree. Therefore, the generated tree cannot be the optimal solution for the classification problem. In this paper, to improve the accuracy of the decision trees, we propose a genetic algorithm with a genuine chromosome structure. We also address the selection of the initial population by considering a blend of randomly generated solutions and solutions from traditional, greedy tree generation algorithms which is constructed for reduced problem instances. The performance of the proposed genetic algorithm is tested using different datasets, varying bounds on the depth of the resulting trees and using different initial population blends within the mentioned varieties. Results reveal that the performance of the proposed genetic algorithm is superior to that of CART in almost all datasets used in the analysis.en_US
dc.identifier.endpage338en_US
dc.identifier.isbn978-989758440-4
dc.identifier.scopus2-s2.0-85091960112
dc.identifier.startpage328en_US
dc.identifier.urihttp://hdl.handle.net/10679/7554
dc.language.isoengen_US
dc.publicationstatusPublisheden_US
dc.publisherSciTePressen_US
dc.relation.ispartofDATA 2020 - Proceedings of the 9th International Conference on Data Science, Technology and Applications
dc.relation.publicationcategoryInternational
dc.rightsrestrictedAccess
dc.subject.keywordsDecision treeen_US
dc.subject.keywordsHeuristicen_US
dc.subject.keywordsGenetic algorithmen_US
dc.subject.keywordsMetaheuristicen_US
dc.titleA CART-based genetic algorithm for constructing higher accuracy decision treesen_US
dc.typeconferenceObjecten_US
dspace.entity.typePublication
relation.isOrgUnitOfPublication5dd73c02-fd2d-43e0-9a23-71bab9ae0b6b
relation.isOrgUnitOfPublication.latestForDiscovery5dd73c02-fd2d-43e0-9a23-71bab9ae0b6b

Files

License bundle

Now showing 1 - 1 of 1
Placeholder
Name:
license.txt
Size:
1.45 KB
Format:
Item-specific license agreed upon to submission
Description: