Truecaller is a mobile application with over 200 million unique users worldwide. Every day truecaller stores over 1 billion rows of data that they use to analyse for improving their product. The data is stored in Hadoop, which is a framework for storing and analysing large amounts of data on a distributed file system. In order to be able to analyse these large amounts of data the analytics team needs a new solution for more lightweight, ad-hoc analysis. This thesis evaluates the performance of the query engine Presto to see if it meets the requirements to help the data analytics team at truecaller gain efficiency. By using a design-science methodology, Presto’s pros and cons are presented. Presto is recommended as a solution to be used toge...
Big Data systems manage and process huge volumes of data constantly generated by various technologie...
Abstract. BigBench is the first proposal for an end to end big data analytics benchmark. It features...
This paper explores how Hadoop-based data analysis tools are developed to illustrate how they addres...
Truecaller is a mobile application with over 200 million unique users worldwide. Every day truecalle...
Truecaller is a mobile application with over 200 million unique users worldwide. Every day truecalle...
In Big Data, SQL-on-Hadoop tools usually provide satisfactory performance for processing vast amount...
Big Data is currently conceptualized as data whose volume, variety or velocity impose significant d...
Due to the extensive use of SQL, the number of SQL-on-Hadoop systems has significantly increased, tr...
Apache Hadoop has provided solutions to the obstacles related to the Big Data processing. Hadoop sto...
Traditional relational database systems can not be efficiently used to analyze data with large volum...
The traditional relational database systems can not accommodate the need of analyzing data with larg...
Managed Hadoop in the cloud, especially SQL-on-Hadoop, has been gaining attention recently. On Platf...
The concept of big data has gained immense significance due to the constant growth of data sets. The...
The advent of big data has prompted both the industry and research for numerous solutions in caterin...
Big Data analytics has now become quintessential for information exploitation, given the amount and ...
Big Data systems manage and process huge volumes of data constantly generated by various technologie...
Abstract. BigBench is the first proposal for an end to end big data analytics benchmark. It features...
This paper explores how Hadoop-based data analysis tools are developed to illustrate how they addres...
Truecaller is a mobile application with over 200 million unique users worldwide. Every day truecalle...
Truecaller is a mobile application with over 200 million unique users worldwide. Every day truecalle...
In Big Data, SQL-on-Hadoop tools usually provide satisfactory performance for processing vast amount...
Big Data is currently conceptualized as data whose volume, variety or velocity impose significant d...
Due to the extensive use of SQL, the number of SQL-on-Hadoop systems has significantly increased, tr...
Apache Hadoop has provided solutions to the obstacles related to the Big Data processing. Hadoop sto...
Traditional relational database systems can not be efficiently used to analyze data with large volum...
The traditional relational database systems can not accommodate the need of analyzing data with larg...
Managed Hadoop in the cloud, especially SQL-on-Hadoop, has been gaining attention recently. On Platf...
The concept of big data has gained immense significance due to the constant growth of data sets. The...
The advent of big data has prompted both the industry and research for numerous solutions in caterin...
Big Data analytics has now become quintessential for information exploitation, given the amount and ...
Big Data systems manage and process huge volumes of data constantly generated by various technologie...
Abstract. BigBench is the first proposal for an end to end big data analytics benchmark. It features...
This paper explores how Hadoop-based data analysis tools are developed to illustrate how they addres...