Keyword: SQL-on-Hadoop : Search

research-article

Investigating Automatic Parameter Tuning for SQL-on-Hadoop Systems

Big Data Research (BDR), Volume 25, Issue Chttps://doi.org/10.1016/j.bdr.2021.100204

Abstract

SQL-on-Hadoop engines such as Hive provide a declarative interface for processing large-scale data over computing frameworks such as Hadoop. The underlying frameworks contain a large number of configuration parameters that can ...

research-article

CirroData: Yet Another SQL-on-Hadoop Data Analytics Engine with High Performance

Journal of Computer Science and Technology (JCST), Volume 35, Issue 1Pages 194–208https://doi.org/10.1007/s11390-020-9536-z

Abstract

This paper presents CirroData, a high-performance SQL-on-Hadoop system designed for Big Data analytics workloads. As a home-grown enterprise-level online analytical processing (OLAP) system with more than seven-year research and development (R&D) ...

research-article

No data left behind: real-time insights from a complex data ecosystem

SoCC '17: Proceedings of the 2017 Symposium on Cloud ComputingPages 108–120https://doi.org/10.1145/3127479.3131208

The typical enterprise data architecture consists of several actively updated data sources (e.g., NoSQL systems, data warehouses), and a central data lake such as HDFS, in which all the data is periodically loaded through ETL processes. To simplify ...

research-article

Evaluating SQL-on-Hadoop for Big Data Warehousing on Not-So-Good Hardware

IDEAS '17: Proceedings of the 21st International Database Engineering & Applications SymposiumPages 242–252https://doi.org/10.1145/3105831.3105842

Big Data is currently conceptualized as data whose volume, variety or velocity impose significant difficulties in traditional techniques and technologies. Big Data Warehousing is emerging as a new concept for Big Data analytics. In this context, SQL-on-...

research-article

Building a Hybrid Warehouse: Efficient Joins between Data Stored in HDFS and Enterprise Warehouse

ACM Transactions on Database Systems (TODS), Volume 41, Issue 4Article No.: 21, Pages 1–38https://doi.org/10.1145/2972950

The Hadoop Distributed File System (HDFS) has become an important data repository in the enterprise as the center for all business analytics, from SQL queries and machine learning to reporting. At the same time, enterprise data warehouses (EDWs) continue ...

research-article

Adaptive Caching in Big SQL using the HDFS Cache

SoCC '16: Proceedings of the Seventh ACM Symposium on Cloud ComputingPages 321–333https://doi.org/10.1145/2987550.2987553

The memory and storage hierarchy in database systems is currently undergoing a radical evolution in the context of Big Data systems. SQL-on-Hadoop systems share data with other applications in the Big Data ecosystem by storing their data in HDFS, using ...

research-article

RBAS: A Real-Time User Behavior Analysis System for Internet TV in Cloud Computing

CFI '16: Proceedings of the 11th International Conference on Future Internet TechnologiesPages 36–42https://doi.org/10.1145/2935663.2935664

The characteristic of Internet TV user behavior is quite essential for designers to optimize resource schedule and improve user experience. With the rapid development of Internet, both Internet TV users and STB (set top boxes) models are booming. This ...

research-article

VectorH: Taking SQL-on-Hadoop to the Next Level

SIGMOD '16: Proceedings of the 2016 International Conference on Management of DataPages 1105–1117https://doi.org/10.1145/2882903.2903742

Actian Vector in Hadoop (VectorH for short) is a new SQL-on-Hadoop system built on top of the fast Vectorwise analytical database system. VectorH achieves fault tolerance and storage scalability by relying on HDFS, and extends the state-of-the-art in ...

research-article

Take me to SSD: a hybrid block-selection method on HDFS based on storage type

SAC '16: Proceedings of the 31st Annual ACM Symposium on Applied ComputingPages 965–971https://doi.org/10.1145/2851613.2851658

As the era of Big-data has risen, the importance of big data technologies is also increasing day by day. Especially, Hadoop has become a critical part of the overall Big-data system because of its ability to store, process, and analyze thousands of ...

short-paper

Flying KIWI: Design of Approximate Query Processing Engine for Interactive Data Analytics at Scale

BigDAS '15: Proceedings of the 2015 International Conference on Big Data Applications and ServicesPages 206–207https://doi.org/10.1145/2837060.2837096

This paper introduces the design of hybrid SQL-on-Hadoop system, which supports dual-mode (interactive and deep) analytics. We present an architecture of approximate query processing engine using horizontal and vertical sampling of the original database ...

Article

Database Architectures: Current State and Development

DATA 2015: Proceedings of 4th International Conference on Data Management Technologies and ApplicationsPages 152–161https://doi.org/10.5220/0005512001520161

The paper presents shortly a history and development of database management tools in last decade. The

movement towards a higher database performance and database scalability is discussed in the context to

requirements of practice. These include Big Data ...

Search Results

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Investigating Automatic Parameter Tuning for SQL-on-Hadoop Systems

CirroData: Yet Another SQL-on-Hadoop Data Analytics Engine with High Performance

No data left behind: real-time insights from a complex data ecosystem

Evaluating SQL-on-Hadoop for Big Data Warehousing on Not-So-Good Hardware

Building a Hybrid Warehouse: Efficient Joins between Data Stored in HDFS and Enterprise Warehouse

Adaptive Caching in Big SQL using the HDFS Cache

RBAS: A Real-Time User Behavior Analysis System for Internet TV in Cloud Computing

VectorH: Taking SQL-on-Hadoop to the Next Level

Take me to SSD: a hybrid block-selection method on HDFS based on storage type

Flying KIWI: Design of Approximate Query Processing Engine for Interactive Data Analytics at Scale

Database Architectures: Current State and Development

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder