Author: Milo, Tova : Search

Applied Filters

Publication Date

People

Publications

Reproducibility Badges

197 Results for: Author: Milo, TovaEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,856,240 records)|Limit your search to The ACM Full-Text Collection (778,683 records)

Showing 1 - 20of197 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
August 2024
Artifacts Available / v1.1
Demonstrating TabEE: Tabular Embedding Explanations
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4285–4288https://doi.org/10.14778/3685800.3685856

We present TabEE, Tabular Embedding Explanations, a framework designed to generate explanations for interpreting tabular embedding models. Our framework aims to furnish both local and global explanations for the original data, facilitating the detection ...
0
26
Metrics
Total Citations0
Total Downloads26
Last 12 Months26
Last 6 weeks10
Get Access
research-article
Open Access
July 2024
Automated Category Tree Construction: Hardness Bounds and Algorithms
ACM Transactions on Database Systems (TODS), Volume 49, Issue 3Article No.: 11, Pages 1–32https://doi.org/10.1145/3664283
Category trees, or taxonomies, are rooted trees where each node, called a category, corresponds to a set of related items. The construction of taxonomies has been studied in various domains, including e-commerce, document management, and question ...
0
780
Metrics
Total Citations0
Total Downloads780
Last 12 Months780
Last 6 weeks94
View online with eReader
PDF
research-article
Open Access
June 2024
Cost-Effective LLM Utilization for Machine Learning Tasks over Tabular Data
GUIDE-AI '24: Proceedings of the Conference on Governance, Understanding and Integration of Data for Effective and Responsible AIPages 45–49https://doi.org/10.1145/3665601.3669848

Classic machine learning (ML) models excel in modeling tabular datasets but lack broader world knowledge due to the absence of pre-training, an area where Large Language Models (LLMs) stand out. This paper presents an effective method that bridges the ...
0
613
Metrics
Total Citations0
Total Downloads613
Last 12 Months613
Last 6 weeks79
View online with eReader
View this article in HTML format
PDF
short-paper
Open Access
June 2024
ASQP-RL Demo: Learning Approximation Sets for Exploratory Queries
SIGMOD/PODS '24: Companion of the 2024 International Conference on Management of DataPages 452–455https://doi.org/10.1145/3626246.3654741

We demonstrate the Approximate Selection Query Processing (ASQP-RL) system, which uses Reinforcement Learning to select a subset of a large external dataset to process locally in a notebook during data exploration. Given a query workload over an external ...
0
198
Metrics
Total Citations0
Total Downloads198
Last 12 Months198
Last 6 weeks47
View online with eReader
PDF
research-article
Open Access
March 2024
TabEE: Tabular Embeddings Explanations
Proceedings of the ACM on Management of Data (PACMMOD), Volume 2, Issue 1Article No.: 72, Pages 1–26https://doi.org/10.1145/3639329

Tabular embedding methods have become increasingly popular due to their effectiveness in improving the results of various tasks, including classic databases tasks and machine learning predictions. However, most current methods treat these embedding ...
1
1,113
Metrics
Total Citations1
Total Downloads1,113
Last 12 Months1,113
Last 6 weeks171
View online with eReader
PDF
Upcoming Conferences

KDD '25

August 3 - 7, 2025

Metro Toronto Convention Centre, Toronto, ON, Canada

KDD '25 Website

CIKM '25

November 10 - 14, 2025

COEX, Seoul, Republic of Korea
short-paper
Open Access
June 2023
ATENA-PRO: Generating Personalized Exploration Notebooks with Constrained Reinforcement Learning
SIGMOD '23: Companion of the 2023 International Conference on Management of DataPages 167–170https://doi.org/10.1145/3555041.3589727

One of the most common, helpful practices of data scientists, when starting the exploration of a given dataset, is to examine existing data exploration notebooks prepared by other data analysts or scientists. These notebooks contain curated sessions of ...
0
522
Metrics
Total Citations0
Total Downloads522
Last 12 Months173
Last 6 weeks20
1
Supplementary Material
atena-pro-demo23.mov
View online with eReader
PDF
research-article
September 2022
Artifacts Available / v1.1
FEDEX: An Explainability Framework for Data Exploration Steps
Proceedings of the VLDB Endowment (PVLDB), Volume 15, Issue 13Pages 3854–3868https://doi.org/10.14778/3565838.3565841

When exploring a new dataset, Data Scientists often apply analysis queries, look for insights in the resulting dataframe, and repeat to apply further queries. We propose in this paper a novel solution that assists data scientists in this laborious ...
4
114
Metrics
Total Citations4
Total Downloads114
Last 12 Months45
Last 6 weeks4
Get Access
research-article
August 2022
PHOcus: efficiently archiving photos
Proceedings of the VLDB Endowment (PVLDB), Volume 15, Issue 12Pages 3630–3633https://doi.org/10.14778/3554821.3554861

Our ability to collect data is rapidly outstripping our ability to effectively store and use it. Organizations are therefore facing tough decisions of what data to archive (or dispose of) to effectively meet their business goals. PHOcus addresses this ...
1
74
Metrics
Total Citations1
Total Downloads74
Last 12 Months25
Last 6 weeks2
Get Access
research-article
August 2022
OREO: detection of cherry-picked generalizations
Proceedings of the VLDB Endowment (PVLDB), Volume 15, Issue 12Pages 3570–3573https://doi.org/10.14778/3554821.3554846

Data analytics often make sense of large data sets by generalization: aggregating from the detailed data to a more general context. Given a dataset, misleading generalizations can sometimes be drawn from a cherry-picked level of aggregation to obscure ...
1
70
Metrics
Total Citations1
Total Downloads70
Last 12 Months14
Last 6 weeks0
Get Access
research-article
Open Access
July 2022
The Seattle report on database research
Communications of the ACM (CACM), Volume 65, Issue 8Pages 72–79https://doi.org/10.1145/3524284

Every five years, a group of the leading database researchers meet to reflect on their community's impact on the computing industry as well as examine current research challenges.
19
34,831
Metrics
Total Citations19
Total Downloads34,831
Last 12 Months3,139
Last 6 weeks206
More
View online with eReader
Digital Edition
View this article on the magazine site (external)
PDF
research-article
June 2022
Automated Category Tree Construction in E-Commerce
SIGMOD '22: Proceedings of the 2022 International Conference on Management of DataPages 1770–1783https://doi.org/10.1145/3514221.3526124

Category trees play a central role in many web applications, enabling browsing-style information access. Building trees that reflect users' dynamic interests is, however, a challenging task, carried out by taxonomists. This manual construction leads to ...
2
230
Metrics
Total Citations2
Total Downloads230
Last 12 Months39
Last 6 weeks3
Get Access
short-paper
June 2022
SubTab: Data Exploration with Informative Sub-Tables
SIGMOD '22: Proceedings of the 2022 International Conference on Management of DataPages 2369–2372https://doi.org/10.1145/3514221.3520154

We demonstrate SubTab, a framework for creating small, informative sub-tables of large data tables to speed up data exploration. Given a table with n rows and m columns where n and m are large, SubTab creates a sub-table T_sub with k<n rows and l<m ...
3
247
Metrics
Total Citations3
Total Downloads247
Last 12 Months28
Last 6 weeks5
Get Access
research-article
June 2022
Classifier Construction Under Budget Constraints
SIGMOD '22: Proceedings of the 2022 International Conference on Management of DataPages 1160–1174https://doi.org/10.1145/3514221.3517863

Search mechanisms over large assortments of items are central to the operation of many platforms. As users commonly express filtering conditions based on item properties that are not initially stored, companies must derive the missing information by ...
2
179
Metrics
Total Citations2
Total Downloads179
Last 12 Months24
Last 6 weeks3
Get Access
editorial
March 2022
AMW 2019 Special Issue
- Aidan Hogan,
- Tova Milo
Information Systems (ISYS), Volume 105, Issue Chttps://doi.org/10.1016/j.is.2020.101663
0
Metrics
Total Citations0
research-article
September 2021
Artifacts Available / v1.1
On detecting cherry-picked generalizations
Proceedings of the VLDB Endowment (PVLDB), Volume 15, Issue 1Pages 59–71https://doi.org/10.14778/3485450.3485457

Generalizing from detailed data to statements in a broader context is often critical for users to make sense of large data sets. Correspondingly, poorly constructed generalizations might convey misleading information even if the statements are ...
5
142
Metrics
Total Citations5
Total Downloads142
Last 12 Months44
Last 6 weeks4
Get Access
research-article
June 2021
Exploring Ratings in Subjective Databases
SIGMOD '21: Proceedings of the 2021 International Conference on Management of DataPages 62–75https://doi.org/10.1145/3448016.3457259

Subjective data links people to content items and reflects who likes or dislikes what. The valuable information this data contains is virtually infinite and satisfies various information needs. Yet, as of today, dedicated tools to explore this data are ...
4
336
Metrics
Total Citations4
Total Downloads336
Last 12 Months26
Last 6 weeks11
1
Supplementary Material
3448016.3457259.mp4
Get Access
research-article
August 2020
ExplainED: explanations for EDA notebooks
Proceedings of the VLDB Endowment (PVLDB), Volume 13, Issue 12Pages 2917–2920https://doi.org/10.14778/3415478.3415508

Exploratory Data Analysis (EDA) is an essential yet highly demanding task. To get a head start before exploring a new dataset, data scientists often prefer to view existing EDA notebooks - illustrative exploratory sessions that were created by fellow ...
7
127
Metrics
Total Citations7
Total Downloads127
Last 12 Months15
Last 6 weeks4
Get Access
research-article
August 2020
CONCIERGE: improving constrained search results by data melioration
Proceedings of the VLDB Endowment (PVLDB), Volume 13, Issue 12Pages 2865–2868https://doi.org/10.14778/3415478.3415495

The problem of finding an item-set of maximal aggregated utility that satisfies a set of constraints is at the cornerstone of many e-commerce applications. Its classical definition assumes that all the information needed to verify the constraints is ...
2
60
Metrics
Total Citations2
Total Downloads60
Last 12 Months10
Last 6 weeks0
Get Access
short-paper
May 2020
Automatically Generating Data Exploration Sessions Using Deep Reinforcement Learning
SIGMOD '20: Proceedings of the 2020 ACM SIGMOD International Conference on Management of DataPages 1527–1537https://doi.org/10.1145/3318464.3389779

Exploratory Data Analysis (EDA) is an essential yet highly demanding task. To get a head start before exploring a new dataset, data scientists often prefer to view existing EDA notebooks -- illustrative, curated exploratory sessions, on the same dataset,...
29
1,092
Metrics
Total Citations29
Total Downloads1,092
Last 12 Months122
Last 6 weeks6
1
Supplementary Material
3318464.3389779.mp4
Get Access
research-article
May 2020
Minimization of Classifier Construction Cost for Search Queries
SIGMOD '20: Proceedings of the 2020 ACM SIGMOD International Conference on Management of DataPages 1351–1365https://doi.org/10.1145/3318464.3389755

Search over massive sets of items is the cornerstone of many modern applications. Users express a set of properties and expect the system to retrieve qualifying items. A common difficulty, however, is that the information on whether an item satisfies ...
3
289
Metrics
Total Citations3
Total Downloads289
Last 12 Months13
Last 6 weeks1
1
Supplementary Material
3318464.3389755.mp4
Get Access

Search Results

Applied Filters

Publication Date

People

Authors

Institutions

Publications

Journal/Magazine Names

All Publications

Content Type

Supplemental Material Type

Publisher

Proceedings Series

ACM SIG Sponsors

Reproducibility Badges

Results

Caption

Demonstrating TabEE: Tabular Embedding Explanations

Automated Category Tree Construction: Hardness Bounds and Algorithms

Cost-Effective LLM Utilization for Machine Learning Tasks over Tabular Data

ASQP-RL Demo: Learning Approximation Sets for Exploratory Queries

TabEE: Tabular Embeddings Explanations

Upcoming Conferences

ATENA-PRO: Generating Personalized Exploration Notebooks with Constrained Reinforcement Learning

FEDEX: An Explainability Framework for Data Exploration Steps

PHOcus: efficiently archiving photos

OREO: detection of cherry-picked generalizations

The Seattle report on database research

Automated Category Tree Construction in E-Commerce

SubTab: Data Exploration with Informative Sub-Tables

Classifier Construction Under Budget Constraints

AMW 2019 Special Issue

On detecting cherry-picked generalizations

Exploring Ratings in Subjective Databases

ExplainED: explanations for EDA notebooks

CONCIERGE: improving constrained search results by data melioration

Automatically Generating Data Exploration Sessions Using Deep Reinforcement Learning

Minimization of Classifier Construction Cost for Search Queries