Koca, MelihArı, İsmailKoçak, UğurÇalıkuş, O.Sezgin, C.2016-02-112016-02-112012978-1-4673-0055-1http://hdl.handle.net/10679/1964https://doi.org/10.1109/SIU.2012.6204832Due to copyright restrictions, the access to the full text of this article is only available via subscription.The fast increase in mobile device and bandwidth usage is generating big workloads on the IT infrastructures of mobile service providers and increasing management costs. These providers collect log files continuously and use these logs for billing, operational and marketing purposes. In this paper, we describe the design, implementation and efficient parallel processing of large-scale mobile logs using the open-source Hadoop-based low-cost private cloud system for near real-time analytics. We find that batching of small files, parallel loading and pipelining of different workloads by overlapping their disk-and-CPU intensive phases can have significant performance benefits. Optimizations were performed in the light of these findings. Our web-based interface helps users explore progress and performance of their workloads.turrestrictedAccessYüksek-ölçekli mobil iletişim verilerinin açık-kaynak hadoop çerçevesi kullanılarak paralel ve iş-hatlı işlenmesiParallel and pipelined processing of large-scale mobile comminucation data using hadoop open-source frameworkconferenceObject1410.1109/SIU.2012.6204832Batch processing (computers)Cloud computingFile organisationInvoicingMarketingMobile communicationMobile computingParallel processingPipeline processingPublic domain softwareRecords managementUser interfaces2-s2.0-84863459703