Publication:
DILAF: A framework for distributed analysis of large-scale system logs for anomaly detection

dc.contributor.authorAstekin, M.
dc.contributor.authorZengin, H.
dc.contributor.authorSözer, Hasan
dc.contributor.departmentComputer Science
dc.contributor.ozuauthorSÖZER, Hasan
dc.date.accessioned2020-08-18T06:15:58Z
dc.date.available2020-08-18T06:15:58Z
dc.date.issued2019-02
dc.description.abstractSystem logs constitute a rich source of information for detection and prediction of anomalies. However, they can include a huge volume of data, which is usually unstructured or semistructured. We introduce DILAF, a framework for distributed analysis of large-scale system logs for anomaly detection. DILAF is comprised of several processes to facilitate log parsing, feature extraction, and machine learning activities. It has two distinguishing features with respect to the existing tools. First, it does not require the availability of source code of the analyzed system. Second, it is designed to perform all the processes in a distributed manner to support scalable analysis in the context of large-scale distributed systems. We discuss the software architecture of DILAF and we introduce an implementation of it. We conducted controlled experiments based on two datasets to evaluate the effectiveness of the framework. In particular, we evaluated the performance and scalability attributes under various degrees of parallelism. Results showed that DILAF can maintain the same accuracy levels while achieving more than 30% performance improvement on average as the system scales, compared to baseline approaches that do not employ fully distributed processing.en_US
dc.identifier.doi10.1002/spe.2653en_US
dc.identifier.endpage170en_US
dc.identifier.issn0038-0644en_US
dc.identifier.issue2en_US
dc.identifier.scopus2-s2.0-85056763559
dc.identifier.startpage153en_US
dc.identifier.urihttp://hdl.handle.net/10679/6780
dc.identifier.urihttps://doi.org/10.1002/spe.2653
dc.identifier.volume49en_US
dc.identifier.wos000459864300002
dc.language.isoengen_US
dc.peerreviewedyesen_US
dc.publicationstatusPublisheden_US
dc.publisherWileyen_US
dc.relation.ispartofSoftware - Practice and Experience
dc.relation.publicationcategoryInternational Refereed Journal
dc.rightsinfo:eu-repo/semantics/restrictedAccess
dc.subject.keywordsAnomaly detectionen_US
dc.subject.keywordsBig dataen_US
dc.subject.keywordsDistributed systemsen_US
dc.subject.keywordsLog analysisen_US
dc.subject.keywordsMachine learningen_US
dc.subject.keywordsParallel processingen_US
dc.subject.keywordsSoftware architectureen_US
dc.titleDILAF: A framework for distributed analysis of large-scale system logs for anomaly detectionen_US
dc.typeArticleen_US
dspace.entity.typePublication
relation.isOrgUnitOfPublication85662e71-2a61-492a-b407-df4d38ab90d7
relation.isOrgUnitOfPublication.latestForDiscovery85662e71-2a61-492a-b407-df4d38ab90d7

Files

License bundle

Now showing 1 - 1 of 1
Placeholder
Name:
license.txt
Size:
1.45 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections