Author: Kraska, Tim : Search

Applied Filters

Publication Date

People

Publications

Reproducibility Badges

132 Results for: Author: Kraska, TimEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,856,240 records)|Limit your search to The ACM Full-Text Collection (778,683 records)

Showing 1 - 20of132 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
Open Access
November 2024
Forecasting Algorithms for Intelligent Resource Scaling: An Experimental Analysis
SoCC '24: Proceedings of the 2024 ACM Symposium on Cloud ComputingPages 126–143https://doi.org/10.1145/3698038.3698564

There has been a growing demand for making modern cloud-based data analytics systems cost-effective and easy to use. AI-powered intelligent resource scaling is one such effort, aiming at automating scaling decisions for serverless offerings like Amazon ...
0
284
Metrics
Total Citations0
Total Downloads284
Last 12 Months284
Last 6 weeks108
View online with eReader
PDF
research-article
Open Access
November 2024
Vista: Machine Learning based Database Performance Troubleshooting Framework in Amazon RDS
SoCC '24: Proceedings of the 2024 ACM Symposium on Cloud ComputingPages 83–98https://doi.org/10.1145/3698038.3698519

Database performance troubleshooting is a complex multi-step process that broadly involves three key stages- (a) Detection: determining what's wrong and when; (b) Root Cause Analysis (RCA): reasoning about why is the performance poor; (c) Resolution: ...
0
166
Metrics
Total Citations0
Total Downloads166
Last 12 Months166
Last 6 weeks59
View online with eReader
PDF
research-article
August 2024
Databases Unbound: Querying All of the World's Bytes with AI
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4546–4554https://doi.org/10.14778/3685800.3685916

Over the past five decades, the relational database model has proven to be a scaleable and adaptable model for querying a variety of structured data, with use cases in analytics, transactions, graphs, streaming and more. However, most of the world's data ...
0
263
Metrics
Total Citations0
Total Downloads263
Last 12 Months263
Last 6 weeks87
Get Access
research-article
August 2024
Resource Management in Aurora Serverless
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4038–4050https://doi.org/10.14778/3685800.3685825

Amazon Aurora Serverless is an on-demand, autoscaling configuration for Amazon Aurora with full MySQL and PostgreSQL compatibility. It automatically offers capacity scale-up/down (i.e., vertical scaling) based on a customer database application's needs. ...
0
77
Metrics
Total Citations0
Total Downloads77
Last 12 Months77
Last 6 weeks21
Get Access
research-article
July 2024
Artifacts Available / v1.1
Why TPC is Not Enough: An Analysis of the Amazon Redshift Fleet
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 11Pages 3694–3706https://doi.org/10.14778/3681954.3682031

Database research and development is heavily influenced by benchmarks, such as the industry-standard TPC-H and TPC-DS for analytical systems. However, these twenty-year-old benchmarks neither capture how databases are deployed nor what workloads modern ...
2
150
Metrics
Total Citations2
Total Downloads150
Last 12 Months150
Last 6 weeks33
Get Access
Upcoming Conferences
Skip slideshow

EuroSys '25

March 30 - April 3, 2025

World Trade Center, Rotterdam, Netherlands

EuroSys '25 Website

CHI 2025

April 26 - May 1, 2025

Pacifico Yokohama, Yokohama, Japan

CHI 2025 Website

KDD '25

August 3 - 7, 2025

Metro Toronto Convention Centre, Toronto, ON, Canada

KDD '25 Website

CHI PLAY '25

October 13 - 16, 2025

Carnegie Mellon Univeristy, Pittsburgh, PA, USA

CHI PLAY '25 Website

CIKM '25

November 10 - 14, 2025

COEX, Seoul, Republic of Korea
research-article
July 2024
Artifacts Available / v1.1
Blueprinting the Cloud: Unifying and Automatically Optimizing Cloud Data Infrastructures with BRAD
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 11Pages 3629–3643https://doi.org/10.14778/3681954.3682026

Modern organizations manage their data with a wide variety of specialized cloud database engines (e.g., Aurora, BigQuery, etc.). However, designing and managing such infrastructures is hard. Developers must consider many possible designs with non-obvious ...
0
153
Metrics
Total Citations0
Total Downloads153
Last 12 Months153
Last 6 weeks18
Get Access
short-paper
Open Access
June 2024
Mallet: SQL Dialect Translation with LLM Rule Generation
- Amadou Latyr Ngom,
- Tim Kraska
aiDM '24: Proceedings of the Seventh International Workshop on Exploiting Artificial Intelligence Techniques for Data ManagementArticle No.: 3, Pages 1–5https://doi.org/10.1145/3663742.3663973

Translating between the SQL dialects of different systems is important for migration and federated query processing. Existing approaches rely on hand-crafted translation rules, which tend to be incomplete and hard to maintain, especially as the number of ...
0
1,064
Metrics
Total Citations0
Total Downloads1,064
Last 12 Months1,064
Last 6 weeks130
View online with eReader
PDF
research-article
Open Access
June 2024
Predicate Caching: Query-Driven Secondary Indexing for Cloud Data Warehouses
SIGMOD/PODS '24: Companion of the 2024 International Conference on Management of DataPages 347–359https://doi.org/10.1145/3626246.3653395

Cloud data warehouses are today's standard for analytical query processing. Multiple cloud vendors offer state-of-the-art systems, such as Amazon Redshift. We have observed that customer workloads experience highly repetitive query patterns, i.e., users ...
3
575
Metrics
Total Citations3
Total Downloads575
Last 12 Months575
Last 6 weeks86
View online with eReader
PDF
research-article
June 2024
Intelligent Scaling in Amazon Redshift
SIGMOD/PODS '24: Companion of the 2024 International Conference on Management of DataPages 269–279https://doi.org/10.1145/3626246.3653394

Cloud-based data warehouses are built to be easy to use, requiring minimal intervention from customers as their workloads scale. However, there are still many dimensions of a workload that they do not scale with automatically. For example, in cloud-...
2
178
Metrics
Total Citations2
Total Downloads178
Last 12 Months178
Last 6 weeks21
Get Access
research-article
Open Access
June 2024
Stage: Query Execution Time Prediction in Amazon Redshift
SIGMOD/PODS '24: Companion of the 2024 International Conference on Management of DataPages 280–294https://doi.org/10.1145/3626246.3653391

Query performance (e.g., execution time) prediction is a critical component of modern DBMSes. As a pioneering cloud data warehouse, Amazon Redshift relies on an accurate execution time prediction for many downstream tasks, ranging from high-level ...
4
880
Metrics
Total Citations4
Total Downloads880
Last 12 Months880
Last 6 weeks141
View online with eReader
PDF
research-article
Open Access
June 2024
Automated Multidimensional Data Layouts in Amazon Redshift
SIGMOD/PODS '24: Companion of the 2024 International Conference on Management of DataPages 55–67https://doi.org/10.1145/3626246.3653379

Analytic data systems typically use data layouts to improve the performance of scanning and filtering data. Common data layout techniques include single-column sort keys, compound sort keys, and more complex multidimensional data layouts such as the Z-...
1
436
Metrics
Total Citations1
Total Downloads436
Last 12 Months436
Last 6 weeks85
View online with eReader
PDF
research-article
July 2023
Check Out the Big Brain on BRAD: Simplifying Cloud Data Processing with Learned Automated Data Meshes
Proceedings of the VLDB Endowment (PVLDB), Volume 16, Issue 11Pages 3293–3301https://doi.org/10.14778/3611479.3611526

The last decade of database research has led to the prevalence of specialized systems for different workloads. Consequently, organizations often rely on a combination of specialized systems, organized in a Data Mesh. Data meshes present significant ...
6
428
Metrics
Total Citations6
Total Downloads428
Last 12 Months232
Last 6 weeks9
Get Access
article
June 2023
Technical Perspective for Sherman: A Write-Optimized Distributed B+Tree Index on Disaggregated Memory
- Tim Kraska
ACM SIGMOD Record (SIGMOD), Volume 52, Issue 1Page 44https://doi.org/10.1145/3604437.3604447

Separation of compute and storage has become the defacto standard for cloud database systems. First proposed in 2007 for database systems [2], it is now widely adopted by all major cloud providers such as Amazon Redshift, Google BigQuery, and Snowflake. ...
0
135
Metrics
Total Citations0
Total Downloads135
Last 12 Months35
Last 6 weeks2
Get Access
research-article
Open Access
June 2023
Auto-WLM: Machine Learning Enhanced Workload Management in Amazon Redshift
SIGMOD '23: Companion of the 2023 International Conference on Management of DataPages 225–237https://doi.org/10.1145/3555041.3589677

There has been a lot of excitement around using machine learning to improve the performance and usability of database systems. However, few of these techniques have actually been used in the critical path of customer-facing database services. In this ...
14
2,520
Metrics
Total Citations14
Total Downloads2,520
Last 12 Months1,227
Last 6 weeks174
1
Supplementary Material
AutoWLM-SIgmod-2023.mp4
View online with eReader
PDF
research-article
Open Access
May 2023
Artifacts Available / v1.1
Artifacts Evaluated & Reusable / v1.1
FactorJoin: A New Cardinality Estimation Framework for Join Queries
Proceedings of the ACM on Management of Data (PACMMOD), Volume 1, Issue 1Article No.: 41, Pages 1–27https://doi.org/10.1145/3588721

Cardinality estimation is one of the most fundamental and challenging problems in query optimization. Neither classical nor learning-based methods yield satisfactory performance when estimating the cardinality of the join queries. They either rely on ...
21
2,171
Metrics
Total Citations21
Total Downloads2,171
Last 12 Months1,121
Last 6 weeks142
3
Supplementary Material
3588721_source_code.zip
3588721_readme.pdf
PACMMOD-V1mod041.mp4
View online with eReader
PDF
research-article
May 2023
Artifacts Available / v1.1
Extract-Transform-Load for Video Streams
Proceedings of the VLDB Endowment (PVLDB), Volume 16, Issue 9Pages 2302–2315https://doi.org/10.14778/3598581.3598600

Social media, self-driving cars, and traffic cameras produce video streams at large scales and cheap cost. However, storing and querying video at such scales is prohibitively expensive. We propose to treat large-scale video analytics as a data ...
7
83
Metrics
Total Citations7
Total Downloads83
Last 12 Months20
Last 6 weeks3
Get Access
research-article
March 2023
Artifacts Available / v1.1
The Case for Learned In-Memory Joins
- Ibrahim Sabek,
- Tim Kraska
Proceedings of the VLDB Endowment (PVLDB), Volume 16, Issue 7Pages 1749–1762https://doi.org/10.14778/3587136.3587148

In-memory join is an essential operator in any database engine. It has been extensively investigated in the database literature. In this paper, we study whether exploiting the CDF-based learned models to boost the join performance is practical. To the ...
3
91
Metrics
Total Citations3
Total Downloads91
Last 12 Months28
Last 6 weeks3
Get Access
research-article
February 2023
Artifacts Available / v1.1
Robust Query Driven Cardinality Estimation under Changing Workloads
Proceedings of the VLDB Endowment (PVLDB), Volume 16, Issue 6Pages 1520–1533https://doi.org/10.14778/3583140.3583164

Query driven cardinality estimation models learn from a historical log of queries. They are lightweight, having low storage requirements, fast inference and training, and are easily adaptable for any kind of query. Unfortunately, such models can suffer ...
20
504
Metrics
Total Citations20
Total Downloads504
Last 12 Months300
Last 6 weeks29
Get Access
research-article
November 2022
Artifacts Available / v1.1
Can Learned Models Replace Hash Functions?
Proceedings of the VLDB Endowment (PVLDB), Volume 16, Issue 3Pages 532–545https://doi.org/10.14778/3570690.3570702

Hashing is a fundamental operation in database management, playing a key role in the implementation of numerous core database data structures and algorithms. Traditional hash functions aim to mimic a function that maps a key to a random value, which can ...
2
498
Metrics
Total Citations2
Total Downloads498
Last 12 Months113
Last 6 weeks4
Get Access
proceeding
September 2022
Heterogeneous Data Management, Polystores, and Analytics for Healthcare: VLDB Workshops, Poly 2022 and DMAH 2022, Virtual Event, September 9, 2022, Revised Selected Papers
0
Metrics
Total Citations0

Search Results

Applied Filters

Publication Date

People

Authors

Institutions

Publications

Journal/Magazine Names

All Publications

Content Type

Supplemental Material Type

Paper Award

Publisher

Proceedings Series

ACM SIG Sponsors

Reproducibility Badges

Results

Caption

Forecasting Algorithms for Intelligent Resource Scaling: An Experimental Analysis

Vista: Machine Learning based Database Performance Troubleshooting Framework in Amazon RDS

Databases Unbound: Querying All of the World's Bytes with AI

Resource Management in Aurora Serverless

Why TPC is Not Enough: An Analysis of the Amazon Redshift Fleet

Upcoming Conferences

Blueprinting the Cloud: Unifying and Automatically Optimizing Cloud Data Infrastructures with BRAD

Mallet: SQL Dialect Translation with LLM Rule Generation

Predicate Caching: Query-Driven Secondary Indexing for Cloud Data Warehouses

Intelligent Scaling in Amazon Redshift

Stage: Query Execution Time Prediction in Amazon Redshift

Automated Multidimensional Data Layouts in Amazon Redshift

Check Out the Big Brain on BRAD: Simplifying Cloud Data Processing with Learned Automated Data Meshes

Technical Perspective for Sherman: A Write-Optimized Distributed B+Tree Index on Disaggregated Memory

Auto-WLM: Machine Learning Enhanced Workload Management in Amazon Redshift

FactorJoin: A New Cardinality Estimation Framework for Join Queries

Extract-Transform-Load for Video Streams

The Case for Learned In-Memory Joins

Robust Query Driven Cardinality Estimation under Changing Workloads

Can Learned Models Replace Hash Functions?

Heterogeneous Data Management, Polystores, and Analytics for Healthcare: VLDB Workshops, Poly 2022 and DMAH 2022, Virtual Event, September 9, 2022, Revised Selected Papers