Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
Mach: Firefighting Time-Critical Issues in Complex Systems Using High-Frequency Telemetry
- Franco Solleza,
- Shihang Li,
- William Sun,
- Richard Tang,
- Malte Schwarzkopf,
- Nesime Tatbul,
- Andrew Crotty,
- David Cohen,
- Stan Zdonik
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4425–4428https://doi.org/10.14778/3685800.3685891To understand the complex interactions in modern software, engineers often rely on high-frequency telemetry (HFT) data generated via tools like eBPF. However, today's database systems are too slow for HFT's rate and volume and cannot process HFT within ...
DeepSketch: A Query Sketching Interface for Deep Time Series Similarity Search
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4369–4372https://doi.org/10.14778/3685800.3685877By empowering domain experts to perform interactive exploration of large time series datasets, sketch-based query interfaces have revitalized interest in the well-studied problem of time series similarity search. In this new interaction paradigm, recent ...
- research-articleJuly 2022
HybriDS: Cache-Conscious Concurrent Data Structures for Near-Memory Processing Architectures
SPAA '22: Proceedings of the 34th ACM Symposium on Parallelism in Algorithms and ArchitecturesPages 321–332https://doi.org/10.1145/3490148.3538591In recent years, the ever-increasing impact of memory access bottlenecks has brought forth a renewed interest in near-memory processing (NMP) architectures. In this work, we propose and empirically evaluate hybrid data structures, which are concurrent ...
- research-articleMay 2020
DeepSqueeze: Deep Semantic Compression for Tabular Data
SIGMOD '20: Proceedings of the 2020 ACM SIGMOD International Conference on Management of DataPages 1733–1746https://doi.org/10.1145/3318464.3389734With the rapid proliferation of large datasets, efficient data compression has become more important than ever. Columnar compression techniques (e.g., dictionary encoding, run-length encoding, delta encoding) have proved highly effective for tabular ...
- research-articleMay 2020
DBPal: A Fully Pluggable NL2SQL Training Pipeline
- Nathaniel Weir,
- Prasetya Utama,
- Alex Galakatos,
- Andrew Crotty,
- Amir Ilkhechi,
- Shekar Ramaswamy,
- Rohin Bhushan,
- Nadja Geisler,
- Benjamin Hättasch,
- Steffen Eger,
- Ugur Cetintemel,
- Carsten Binnig
SIGMOD '20: Proceedings of the 2020 ACM SIGMOD International Conference on Management of DataPages 2347–2361https://doi.org/10.1145/3318464.3380589Natural language is a promising alternative interface to DBMSs because it enables non-technical users to formulate complex questions in a more concise manner than SQL. Recently, deep learning has gained traction for translating natural language to SQL, ...
- research-articleAugust 2017
How Progressive Visualizations Affect Exploratory Analysis
IEEE Transactions on Visualization and Computer Graphics (ITVC), Volume 23, Issue 8Pages 1977–1987https://doi.org/10.1109/TVCG.2016.2607714The stated goal for visual data exploration is to operate at a rate that matches the pace of human data analysts, but the ever increasing amount of data has led to a fundamental problem: datasets are often too large to process within interactive time ...
- research-articleJune 2017
Revisiting reuse for approximate query processing
Proceedings of the VLDB Endowment (PVLDB), Volume 10, Issue 10Pages 1142–1153https://doi.org/10.14778/3115404.3115418Visual data exploration tools allow users to quickly gather insights from new datasets. As dataset sizes continue to increase, though, new techniques will be necessary to maintain the interactivity guarantees that these tools require. Approximate query ...
- abstractMay 2017
Discrete Time Specifications In Temporal Queries
CHI EA '17: Proceedings of the 2017 CHI Conference Extended Abstracts on Human Factors in Computing SystemsPages 2536–2542https://doi.org/10.1145/3027063.3053222Analysis, exploration, and visualization of time-oriented data are ubiquitous tasks in various application domains, all of which involve the execution of temporal queries. Prior research in interactively specifying the time component for such queries ...
- research-articleJune 2016
The case for interactive data exploration accelerators (IDEAs)
HILDA '16: Proceedings of the Workshop on Human-In-the-Loop Data AnalyticsArticle No.: 11, Pages 1–6https://doi.org/10.1145/2939502.2939513Enabling interactive visualization over new datasets at "human speed" is key to democratizing data science and maximizing human productivity. In this work, we first argue why existing analytics infrastructures do not support interactive data exploration ...
- research-articleMarch 2016
The end of slow networks: it's time for a redesign
Proceedings of the VLDB Endowment (PVLDB), Volume 9, Issue 7Pages 528–539https://doi.org/10.14778/2904483.2904485The next generation of high-performance networks with remote direct memory access (RDMA) capabilities requires a fundamental rethinking of the design of distributed in-memory DBMSs. These systems are commonly built under the assumption that the network ...
- research-articleAugust 2015
Vizdom: interactive analytics through pen and touch
Proceedings of the VLDB Endowment (PVLDB), Volume 8, Issue 12Pages 2024–2027https://doi.org/10.14778/2824032.2824127Machine learning (ML) and advanced statistics are important tools for drawing insights from large datasets. However, these techniques often require human intervention to steer computation towards meaningful results. In this demo, we present Vizdom, a new ...
- research-articleAugust 2015
An architecture for compiling UDF-centric workflows
Proceedings of the VLDB Endowment (PVLDB), Volume 8, Issue 12Pages 1466–1477https://doi.org/10.14778/2824032.2824045Data analytics has recently grown to include increasingly sophisticated techniques, such as machine learning and advanced statistics. Users frequently express these complex analytics tasks as workflows of user-defined functions (UDFs) that specify each ...