Publication:
A CART-based genetic algorithm for constructing higher accuracy decision trees

dc.contributor.authorErsoy, Elif
dc.contributor.authorAlbey, Erinç
dc.contributor.authorKayış, Enis
dc.contributor.departmentIndustrial Engineering
dc.contributor.editorHammoudi, S.
dc.contributor.editorQuix, C.
dc.contributor.editorBernardino, J.
dc.contributor.ozuauthorALBEY, Erinç
dc.contributor.ozuauthorKAYIŞ, Enis
dc.contributor.ozugradstudentErsoy, Elif
dc.date.accessioned2021-09-16T07:19:19Z
dc.date.available2021-09-16T07:19:19Z
dc.date.issued2020
dc.description.abstractDecision trees are among the most popular classification methods due to ease of implementation and simple interpretation. In traditional methods like CART (classification and regression tree), ID4, C4.5; trees are constructed by myopic, greedy top-down induction strategy. In this strategy, the possible impact of future splits in the tree is not considered while determining each split in the tree. Therefore, the generated tree cannot be the optimal solution for the classification problem. In this paper, to improve the accuracy of the decision trees, we propose a genetic algorithm with a genuine chromosome structure. We also address the selection of the initial population by considering a blend of randomly generated solutions and solutions from traditional, greedy tree generation algorithms which is constructed for reduced problem instances. The performance of the proposed genetic algorithm is tested using different datasets, varying bounds on the depth of the resulting trees and using different initial population blends within the mentioned varieties. Results reveal that the performance of the proposed genetic algorithm is superior to that of CART in almost all datasets used in the analysis.
dc.identifier.endpage338
dc.identifier.isbn978-989758440-4
dc.identifier.scopus2-s2.0-85091960112
dc.identifier.startpage328
dc.identifier.urihttp://hdl.handle.net/10679/7554
dc.language.isoeng
dc.publicationstatusPublished
dc.publisherSciTePress
dc.relation.ispartofDATA 2020 - Proceedings of the 9th International Conference on Data Science, Technology and Applications
dc.relation.publicationcategoryInternational
dc.rightsrestrictedAccess
dc.subject.keywordsDecision tree
dc.subject.keywordsHeuristic
dc.subject.keywordsGenetic algorithm
dc.subject.keywordsMetaheuristic
dc.titleA CART-based genetic algorithm for constructing higher accuracy decision trees
dc.typeconferenceObject
dc.type.subtypeConference paper
dspace.entity.typePublication
relation.isOrgUnitOfPublication5dd73c02-fd2d-43e0-9a23-71bab9ae0b6b
relation.isOrgUnitOfPublication.latestForDiscovery5dd73c02-fd2d-43e0-9a23-71bab9ae0b6b

Files

License bundle

Now showing 1 - 1 of 1
Placeholder
Name:
license.txt
Size:
1.45 KB
Format:
Item-specific license agreed upon to submission
Description: