Browsing by Author "Mete, Emrah"
Now showing 1 - 1 of 1
- Results Per Page
- Sort Options
Master ThesisPublication Metadata only Modern data management strategies for machine learning tasks: A sports analytics use case on cloud platformMete, Emrah; Albey, Erinç; Albey, Erinç; Özener, Okan Örsan; Güler, M. G.; Department of Data Science; Mete, EmrahThere is no doubt that data is the most valuable asset today. The efforts of enterprises in digital transformation and creating a data-driven culture are the most concrete indicators of this. Nowadays, where data transforms all industries, it is possible to follow the rapidly developing technological developments in this field. Appropriate data management strategies are the basis of creating data-driven organizations. When the evolution of data management architectures is examined, it is possible to say that the biggest factor that triggers this evolution is the changing and increasing data sources and the velocity of data production. In addition, with the increase in the importance of business use cases that need to be done in real time, it has become a very crucial need to process data quickly and turn it into action. Today when data is strategic importance, enterprises that can manage data correctly could gain competitive advantages. Being able to the correct data management can be built with the support of up-to-date and modern approaches. The infrastructures established by the integration of new and modern methods into the platforms turn into more agile structures. This increases the number of value added services to be produced from data by providing speed and flexibility to organizations. Today, the outputs expected to be produced from data management platforms go beyond descriptive and diagnostic analytic. Now, artificial intelligence, machine learning and data science are important parts of these platforms and these are opened new channels for the future of enterprises. In this thesis, basic needs and capabilities of modern data management architectures are described and detailed explanations were made on reference architectures in the industry. Besides, data management strategies and expectations were discussed. An example prototype of the data management platforms, which is explained in detail in this thesis, has also been developed on the cloud platforms. In this prototype, the entire life cycle of the data was considered and each step was developed in detail. In addition, a data science project was developed using the data collected on the platform. Thus, an end-to-end solution has been implemented.