Proceedings of the thirtieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems

PODS '11: Proceedings of the thirtieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems

June 2011

2011 Proceeding

General Chair:
Maurizio Lenzerini
Sapienza University of Rome, Italy
,
Program Chair:
Thomas Schwentick
TU Dortmund University, Germany

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

SIGMOD/PODS '11: International Conference on Management of Data Athens Greece June 13 - 15, 2011

ISBN:

978-1-4503-0660-7

Published:

13 June 2011

Sponsors:

SIGMOD

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Bibliometrics

Abstract

It is our great pleasure to welcome you to the 2011 ACM Symposium on Principles of Database Systems -- PODS'11. This year's symposium continues its tradition of being the premier international conference on the theoretical aspects of data management. PODS papers are distinguished by a rigorous approach to widely diverse problems in databases, often bringing to bear techniques from a variety of different areas, including computational logic, finite model theory, computational complexity, algorithm design and analysis, programming languages, and artificial intelligence. The first PODS conference was held in Los Angeles (CA) in 1982, with Jeffrey D. Ullman as General Chair. Since that time, virtually all new ideas, methods and techniques for data management have been investigated and presented in subsequent PODS conferences (see http://www09.sigmod.org/sigmod/pods/ for various information on the conference series).

To celebrate the 30th anniversary of the Symposium, the PODS Executive Committee has organized the PODS 30th Anniversary Colloquium, a special event held in June 12, 2011, with the goal of providing a retrospective on the role of database theory, and outlining a picture of the future directions of the discipline. The Colloquium featured five invited presentations from distinguished leaders in the field, namely:

Moshe Y. Vardi: "The rise, fall, and rise of dependency theory: Part 1, the rise and fall", Ronald Fagin: "The rise, fall, and rise of dependency theory: Part 2, the rise from the ashes", Jeffrey D. Ullman: "Deductive Databases",

Serge Abiteboul: "Trees, semistructured data, and other strange ways to go beyond tables", Victor Vianu: "Database Theory: Back to the Future".

The PODS Executive Committee is grateful to the speakers for their participation in the event, and to Frank Neven for organizing a lively discussion session ending the Colloquium.

This volume contains the proceedings of the Thirtieth ACMSIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS 2011), held in Athens, Greece, on June 13-15, 2011, in conjunction with the 2011 ACM SIGMOD International Conference on Management of Data.

The proceedings include a paper by Daniel Deutch and Tova Milo based on the keynote address by Tova Milo and two papers, the first by Marcelo Arenas and Jorge Pérez, based on the tutorial byMarcelo Arenas, and the second based on the tutorial by S. Muthu Muthukrishnan, and 25 contributed papers that were selected by the Program Committee from 113 submissions. Most of these papers are preliminary reports on work in progress. While they have been read by program committee members, they have not been formally refereed. Many of them will probably appear in more polished and detailed form in scientific journals.

The program committee selected the paper Data Exchange beyond Complete Data by Marcelo Arenas, Jorge Pérez and Juan L. Reutter for the PODS 2011 Best Paper Award. In addition, the announcement of the 2011 ACM PODS Alberto O. Mendelzon Test-of-Time Award appears in the proceedings. This year, the award is given to Optimal Aggregation Algorithms for Middleware by Ronald Fagin, Amnon Lotem, and Moni Naor. The paper originally appeared in the proceedings of PODS 2001. Warmest congratulations to the authors of these papers.

Proceeding Downloads

PDF(title page, copyright, general chair's welcome message, program chair's message, contents, organization, sponsors)

PDF(author index)

ZIP

1989284.bib

Select All

Export Citations Save to Binder

invited-talk

PODS 30th Anniversary Colloquium

https://doi.org/10.1145/1989284.2220036

SESSION: Keynote address

keynote

A quest for beauty and wealth (or, business processes for database researchers)

Pages 1–12https://doi.org/10.1145/1989284.1989286

While classic data management focuses on the data itself, research on Business Processes considers also the context in which this data is generated and manipulated, namely the processes, the users, and the goals that this data serves. This allows the ...

SESSION: Streaming & sampling

research-article

Get the most out of your sample: optimal unbiased estimators using partial information

Pages 13–24https://doi.org/10.1145/1989284.1989288

Random sampling is an essential tool in the processing and transmission of data. It is used to summarize data too large to store or manipulate and meet resource constraints on bandwidth or battery power. Estimators that are applied to the sample ...

research-article

Tight bounds for Lp samplers, finding duplicates in streams, and related problems

Pages 49–58https://doi.org/10.1145/1989284.1989289

In this paper, we present near-optimal space bounds for L_p-samplers. Given a stream of updates (additions and subtraction) to the coordinates of an underlying vector x in Rⁿ, a perfect L_p sampler outputs the i-th coordinate with probability x_i^pxpp. In ...

research-article

Pan-private algorithms via statistics on sketches

Pages 37–48https://doi.org/10.1145/1989284.1989290

Consider fully dynamic data, where we track data as it gets inserted and deleted. There are well developed notions of private data analyses with dynamic data, for example, using differential privacy. We want to go beyond privacy, and consider privacy ...

research-article

FIFO indexes for decomposable problems

Pages 25–35https://doi.org/10.1145/1989284.1989291

This paper studies first-in-first-out (FIFO) indexes, each of which manages a dataset where objects are deleted in the same order as their insertions. We give a technique that converts a static data structure to a FIFO index for all decomposable ...

SESSION: Incomplete information & awards

research-article

Data exchange beyond complete data

Pages 83–94https://doi.org/10.1145/1989284.1989293

In the traditional data exchange setting, source instances are restricted to be complete in the sense that every fact is either true or false in these instances. Although natural for a typical database translation scenario, this restriction is gradually ...

research-article

Incomplete information and certain answers in general data models

Leonid Libkin

Pages 59–70https://doi.org/10.1145/1989284.1989294

While incomplete information is ubiquitous in all data models - especially in applications involving data translation or integration - our understanding of it is still not completely satisfactory. For example, even such a basic notion as certain answers ...

research-article

Determining the currency of data

Pages 71–82https://doi.org/10.1145/1989284.1989295

Data in real-life databases become obsolete rapidly. One often finds that multiple values of the same entity reside in a database. While all of these values were once correct, most of them may have become stale and inaccurate. Worse still, the values ...

SESSION: Index structures & external memory

research-article

New results on two-dimensional orthogonal range aggregation in external memory

Pages 129–139https://doi.org/10.1145/1989284.1989297

We consider the orthogonal range aggregation problem. The dataset S consists of N axis-parallel rectangles in R², each of which is associated with an integer weight. Given an axis-parallel rectangle Q and an aggregate function F, a query reports the ...

research-article

On finding skylines in external memory

Pages 107–116https://doi.org/10.1145/1989284.1989298

We consider the skyline problem (a.k.a. the maxima problem), which has been extensively studied in the database community. The input is a set P of d-dimensional points. A point dominates another if the former has a lower coordinate than the latter on ...

research-article

Beyond simple aggregates: indexing for summary queries

Pages 117–128https://doi.org/10.1145/1989284.1989299

Database queries can be broadly classified into two categories: reporting queries and aggregation queries. The former retrieves a collection of records from the database that match the query's conditions, while the latter returns an aggregate, such as ...

research-article

Space-efficient substring occurrence estimation

Pages 95–106https://doi.org/10.1145/1989284.1989300

We study the problem of estimating the number of occurrences of substrings in textual data: A text T on some alphabet £ of size Ã is preprocessed and an index I is built. The index is used in lieu of the text to answer queries of the form CountH(P), ...

SESSION: Provenance

research-article

Provenance for aggregate queries

Pages 153–164https://doi.org/10.1145/1989284.1989302

We study in this paper provenance information for queries with aggregation. Provenance information was studied in the context of various query languages that do not allow for aggregation, and recent work has suggested to capture provenance by annotating ...

research-article

On provenance minimization

Pages 141–152https://doi.org/10.1145/1989284.1989303

Provenance information has been proved to be very effective in capturing the computational process performed by queries, and has been used extensively as the input to many advanced data management tools (e.g. view maintenance, trust assessment, or query ...

research-article

On the complexity of privacy-preserving complex event processing

Pages 165–174https://doi.org/10.1145/1989284.1989304

Complex Event Processing (CEP) Systems are stream processing systems that monitor incoming event streams in search of userspecified event patterns. While CEP systems have been adopted in a variety of applications, the privacy implications of event ...

research-article

Provenance views for module privacy

Pages 175–186https://doi.org/10.1145/1989284.1989305

Scientific workflow systems increasingly store provenance information about the module executions used to produce a data item, as well as the parameter settings and intermediate data items passed between module executions. However, authors/owners of ...

SESSION: Queries & views

research-article

Querying graph patterns

Pages 199–210https://doi.org/10.1145/1989284.1989307

Graph data appears in a variety of application domains, and many uses of it, such as querying, matching, and transforming data, naturally result in incompletely specified graph data, i.e., graph patterns. While queries need to be posed against such data,...

research-article

Maximizing conjunctive views in deletion propagation

Pages 187–198https://doi.org/10.1145/1989284.1989308

In deletion propagation, tuples from the database are deleted in order to reflect the deletion of a tuple from the view. Such an operation may result in the (often necessary) deletion of additional tuples from the view, besides the intentionally deleted ...

research-article

Determining relevance of accesses at runtime

Pages 211–222https://doi.org/10.1145/1989284.1989309

Consider the situation where a query is to be answered using Web sources that restrict the accesses that can be made on backend relational data by requiring some attributes to be given as input of the service. The accesses provide lookups on the ...

research-article

Parallel evaluation of conjunctive queries

Pages 223–234https://doi.org/10.1145/1989284.1989310

The availability of large data centers with tens of thousands of servers has led to the popular adoption of massive parallelism for data analysis on large datasets. Several query languages exist for running queries on massively parallel architectures, ...

TUTORIAL SESSION: Tutorial 1

tutorial

Querying semantic web data with SPARQL

Pages 305–316https://doi.org/10.1145/1989284.1989312

The Semantic Web is the initiative of the W3C to make information on the Web readable not only by humans but also by machines. RDF is the data model for Semantic Web data, and SPARQL is the standard query language for this data model. In the last ten ...

TUTORIAL SESSION: Tutorial 2

research-article

Theory of data stream computing: where to go

S. Muthukrishnan

Pages 317–319https://doi.org/10.1145/1989284.1989314

Computing power has been growing steadily, just as communication rate and memory size. Simultaneously our ability to create data has been growing phenomenally and therefore the need to analyze it. We now have examples of massive data streams that are ...

SESSION: Semistructured data & XML

research-article

The complexity of text-preserving XML transformations

Pages 247–258https://doi.org/10.1145/1989284.1989316

While XML is nowadays adopted as the de facto standard for data exchange, historically, its predecessor SGML was invented for describing electronic documents, i.e., marked up text. Actually, today there are still large volumes of such XML texts. We ...

research-article

Efficient evaluation for a temporal logic on changing XML documents

Pages 259–270https://doi.org/10.1145/1989284.1989317

We consider a sequence t₁,...,t_k of XML documents that is produced by a sequence of local edit operations. To describe properties of such a sequence, we use a temporal logic. The logic can navigate both in time and in the document, e.g. a formula can ...

research-article

Finding a minimal tree pattern under neighborhood constraints

Pages 235–246https://doi.org/10.1145/1989284.1989318

Tools that automatically generate queries are useful when schemas are hard to understand due to size or complexity. Usually, these tools find minimal tree patterns that contain a given set (or bag) of labels. The labels could be, for example, XML tags ...

SESSION: Rule-based query languages

research-article

A rule-based language for web data management

Pages 293–304https://doi.org/10.1145/1989284.1989320

There is a new trend to use Datalog-style rule-based languages to specify modern distributed applications, notably on the Web. We introduce here such a language for a distributed data model where peers exchange messages (i.e. logical facts) as well as ...

research-article

Relational transducers for declarative networking

Pages 283–292https://doi.org/10.1145/1989284.1989321

Motivated by a recent conjecture concerning the expressiveness of declarative networking, we propose a formal computation model for "eventually consistent" distributed querying, based on relational transducers. A tight link has been conjectured between ...

research-article

Rewrite rules for search database systems

Pages 271–282https://doi.org/10.1145/1989284.1989322

The results of a search engine can be improved by consulting auxiliary data. In a search database system, the association between the user query and the auxiliary data is driven by rewrite rules that augment the user query with a set of alternative ...

Cited By

Kifer M and Liu Y (2018). Declarative Logic Programming, 10.1145/3191315, Online publication date: 1-Sep-2018.

Contributors

Maurizio Lenzerini
Sapienza University of Rome
- Publication Years1981 - 2023
- Publication counts169
- Citation count5,744
- Available for Download37
- Downloads (cumulative)38,644
- Downloads (12 months)3,791
- Downloads (6 weeks)401
- Average Downloads per Article1,044
- Average Citation per Article34
View Full Profile
Thomas Schwentick
Technical University Dortmund
- Publication Years1994 - 2021
- Publication counts107
- Citation count2,227
- Available for Download42
- Downloads (cumulative)27,475
- Downloads (12 months)1,312
- Downloads (6 weeks)200
- Average Downloads per Article654
- Average Citation per Article21
View Full Profile

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

Acceptance Rates

PODS '11 Paper Acceptance Rate 25 of 113 submissions, 22%;

Overall Acceptance Rate 642 of 2,707 submissions, 24%

Year	Submitted	Accepted	Rate
PODS '19	87	29	33%
PODS '17	101	29	29%
PODS '16	94	31	33%
PODS '15	80	25	31%
PODS '14	67	22	33%
PODS '13	97	24	25%
PODS '12	101	26	26%
PODS '11	113	25	22%
PODS '10	113	27	24%
PODS '09	97	26	27%
PODS '08	159	28	18%
PODS '07	187	28	15%
PODS '06	185	35	19%
PODS '03	136	27	20%
PODS '02	109	24	22%
PODS '01	99	26	26%
PODS '00	119	26	22%
PODS '99	116	32	28%
PODS '98	119	28	24%
PODS '97	118	23	19%
PODS '96	84	22	26%
PODS '95	94	25	27%
PODS '94	117	28	24%
PODS '93	115	26	23%
Overall	2,707	642	24%

PODS

Sections

Proceeding Downloads

PODS 30th Anniversary Colloquium

A quest for beauty and wealth (or, business processes for database researchers)

Get the most out of your sample: optimal unbiased estimators using partial information

Tight bounds for Lp samplers, finding duplicates in streams, and related problems

Pan-private algorithms via statistics on sketches

FIFO indexes for decomposable problems

Data exchange beyond complete data

Incomplete information and certain answers in general data models

Determining the currency of data

New results on two-dimensional orthogonal range aggregation in external memory

On finding skylines in external memory

Beyond simple aggregates: indexing for summary queries

Space-efficient substring occurrence estimation

Provenance for aggregate queries

On provenance minimization

On the complexity of privacy-preserving complex event processing

Provenance views for module privacy

Querying graph patterns

Maximizing conjunctive views in deletion propagation

Determining relevance of accesses at runtime

Parallel evaluation of conjunctive queries

Querying semantic web data with SPARQL

Theory of data stream computing: where to go

The complexity of text-preserving XML transformations

Efficient evaluation for a temporal logic on changing XML documents

Finding a minimal tree pattern under neighborhood constraints

A rule-based language for web data management

Relational transducers for declarative networking

Rewrite rules for search database systems

Cited By

PODS '14: Proceedings of the 33rd ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems

PODS '09: Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems

PODS '03: Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems

Acceptance Rates