Rodrigues, Mário; Santos, Maribel Yasmina; Bernardino, Jorge

doi:10.1002/widm.1297

Back to matches

Your institution may have access to this item. Find your institution then sign in to continue.

Title: Big data processing tools: An experimental performance evaluation.
Authors: Rodrigues, Mário; Santos, Maribel Yasmina; Bernardino, Jorge
Abstract: Big Data is currently a hot topic of research and development across several business areas mainly due to recent innovations in information and communication technologies. One of the main challenges of Big Data relates to how one should efficiently handle massive volumes of complex data. Due to the notorious complexity of the data that can be collected from multiple sources, usually motivated by increasing data volumes gathered at high velocity, efficient processing mechanisms are needed for data analysis purposes. Motivated by the rapid growth in technology, development of tools, and frameworks for Big Data, there is much discussion about Big Data querying tools and, specifically, those that are more appropriated for specific analytical needs. This paper describes and evaluates the following popular Big Data processing tools: Drill, HAWQ, Hive, Impala, Presto, and Spark. An experimental evaluation using the Transaction Processing Council (TPC‐H) benchmark is presented and discussed, highlighting the performance of each tool, according to different workloads and query types. This article is categorized under:Technologies > Computer Architectures for Data MiningFundamental Concepts of Data and Knowledge > Big Data MiningTechnologies > Data PreprocessingApplication Areas > Data Mining Software Tools Evaluating Big Data processing tools: Drill, HAWQ, Hive, Impala, Presto, and Spark using TPC‐H benchmark with 10 GB, 50 GB, and 100 GB datasets.
Subjects: BIG data; INFORMATION &; communication technologies; COMPUTATIONAL complexity; SEARCH algorithms; DATA mining
Publication: WIREs: Data Mining & Knowledge Discovery, 2019, Vol 9, Issue 2, pN.PAG
ISSN: 1942-4787
Publication type: Article
DOI: 10.1002/widm.1297

We found a match

Big data processing tools: An experimental performance evaluation.

Rodrigues, Mário; Santos, Maribel Yasmina; Bernardino, Jorge

BIG data; INFORMATION &; communication technologies; COMPUTATIONAL complexity; SEARCH algorithms; DATA mining

WIREs: Data Mining & Knowledge Discovery, 2019, Vol 9, Issue 2, pN.PAG

1942-4787

Article

10.1002/widm.1297