MaLeFICE: Machine learning support for continuous performance improvement in computational engineering

Sönmezer, Hasan Berk; Muhtaroğlu, Nitel; Arı, İsmail; Gökçin, Deniz

Publication:
MaLeFICE: Machine learning support for continuous performance improvement in computational engineering

dc.contributor.author	Sönmezer, Hasan Berk
dc.contributor.author	Muhtaroğlu, Nitel
dc.contributor.author	Arı, İsmail
dc.contributor.author	Gökçin, Deniz
dc.contributor.department	Computer Science
dc.contributor.ozuauthor	ARI, Ismail
dc.contributor.ozugradstudent	Sönmezer, Hasan Berk
dc.contributor.ozugradstudent	Muhtaroğlu, Nitel
dc.contributor.ozugradstudent	Gökçin, Deniz
dc.date.accessioned	2022-09-07T07:00:52Z
dc.date.available	2022-09-07T07:00:52Z
dc.date.issued	2022-04-25
dc.description.abstract	Computer aided engineering (CAE) practices improved drastically within the last decade due to ease of access to computing resources and open-source software. However, increasing complexity of hardware and software settings and the scarcity of multiskilled personnel rendered the practice inefficient and infeasible again. In this article, we present a method for continuous performance improvement in computational engineering that combines online performance profiling with machine learning (ML). To test the viability of this method, we provide a detailed analysis for solution time estimation of finite element analysis (FEA) jobs based on multidimensional models. These models combine numerous matrix features (matrix size, density, bandwidth, etc.), solver features (direct-iterative, preconditioning, tolerance), and hardware features (core count, virtual–physical). We repeat our analysis over different machines as well as docker containers to demonstrate applicability over different platforms. Next, we train supervised and unsupervised ML algorithms over commonly used, realistic FEA benchmarks and compare accuracy of different models. Finally, we design two new ML-based online batch schedulers called shortest predicted time first (SPTF) and shortest cluster time first (SCTF), which are comparable in performance to the optimal, but offline shortest job first (SJF) scheduler. We find that ML-based profiling and scheduling can reduce the average turnaround times by 2x –5x over other alternatives.
dc.identifier.doi	10.1002/cpe.6674
dc.identifier.issn	1532-0626
dc.identifier.issue	9
dc.identifier.scopus	2-s2.0-85117046619
dc.identifier.uri	http://hdl.handle.net/10679/7833
dc.identifier.uri	https://doi.org/10.1002/cpe.6674
dc.identifier.volume	34
dc.identifier.wos	000707186900001
dc.language.iso	eng
dc.publicationstatus	Published
dc.publisher	Wiley
dc.relation.ispartof	Concurrency and Computation: Practice and Experience
dc.relation.publicationcategory	International Refereed Journal
dc.rights	restrictedAccess
dc.subject.keywords	Batch scheduling
dc.subject.keywords	Classification
dc.subject.keywords	Cloud
dc.subject.keywords	Clustering
dc.subject.keywords	DevOp
dc.subject.keywords	Docker
dc.subject.keywords	Finite element analysis
dc.subject.keywords	Machine learning
dc.subject.keywords	Virtualization
dc.title	MaLeFICE: Machine learning support for continuous performance improvement in computational engineering
dc.type	article
dspace.entity.type	Publication
relation.isOrgUnitOfPublication	85662e71-2a61-492a-b407-df4d38ab90d7
relation.isOrgUnitOfPublication.latestForDiscovery	85662e71-2a61-492a-b407-df4d38ab90d7

Files

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.45 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Computer Science

Publication: MaLeFICE: Machine learning support for continuous performance improvement in computational engineering

Files

License bundle

Collections

Publication:
MaLeFICE: Machine learning support for continuous performance improvement in computational engineering