It is our great pleasure to welcome you to the 2011 ACM Symposium on Principles of Database Systems -- PODS'11. This year's symposium continues its tradition of being the premier international conference on the theoretical aspects of data management. PODS papers are distinguished by a rigorous approach to widely diverse problems in databases, often bringing to bear techniques from a variety of different areas, including computational logic, finite model theory, computational complexity, algorithm design and analysis, programming languages, and artificial intelligence. The first PODS conference was held in Los Angeles (CA) in 1982, with Jeffrey D. Ullman as General Chair. Since that time, virtually all new ideas, methods and techniques for data management have been investigated and presented in subsequent PODS conferences (see http://www09.sigmod.org/sigmod/pods/ for various information on the conference series).
To celebrate the 30th anniversary of the Symposium, the PODS Executive Committee has organized the PODS 30th Anniversary Colloquium, a special event held in June 12, 2011, with the goal of providing a retrospective on the role of database theory, and outlining a picture of the future directions of the discipline. The Colloquium featured five invited presentations from distinguished leaders in the field, namely:
Moshe Y. Vardi: "The rise, fall, and rise of dependency theory: Part 1, the rise and fall", Ronald Fagin: "The rise, fall, and rise of dependency theory: Part 2, the rise from the ashes", Jeffrey D. Ullman: "Deductive Databases",
Serge Abiteboul: "Trees, semistructured data, and other strange ways to go beyond tables", Victor Vianu: "Database Theory: Back to the Future".
The PODS Executive Committee is grateful to the speakers for their participation in the event, and to Frank Neven for organizing a lively discussion session ending the Colloquium.
This volume contains the proceedings of the Thirtieth ACMSIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS 2011), held in Athens, Greece, on June 13-15, 2011, in conjunction with the 2011 ACM SIGMOD International Conference on Management of Data.
The proceedings include a paper by Daniel Deutch and Tova Milo based on the keynote address by Tova Milo and two papers, the first by Marcelo Arenas and Jorge Pérez, based on the tutorial byMarcelo Arenas, and the second based on the tutorial by S. Muthu Muthukrishnan, and 25 contributed papers that were selected by the Program Committee from 113 submissions. Most of these papers are preliminary reports on work in progress. While they have been read by program committee members, they have not been formally refereed. Many of them will probably appear in more polished and detailed form in scientific journals.
The program committee selected the paper Data Exchange beyond Complete Data by Marcelo Arenas, Jorge Pérez and Juan L. Reutter for the PODS 2011 Best Paper Award. In addition, the announcement of the 2011 ACM PODS Alberto O. Mendelzon Test-of-Time Award appears in the proceedings. This year, the award is given to Optimal Aggregation Algorithms for Middleware by Ronald Fagin, Amnon Lotem, and Moni Naor. The paper originally appeared in the proceedings of PODS 2001. Warmest congratulations to the authors of these papers.
Proceeding Downloads
A quest for beauty and wealth (or, business processes for database researchers)
While classic data management focuses on the data itself, research on Business Processes considers also the context in which this data is generated and manipulated, namely the processes, the users, and the goals that this data serves. This allows the ...
Get the most out of your sample: optimal unbiased estimators using partial information
Random sampling is an essential tool in the processing and transmission of data. It is used to summarize data too large to store or manipulate and meet resource constraints on bandwidth or battery power. Estimators that are applied to the sample ...
Tight bounds for Lp samplers, finding duplicates in streams, and related problems
In this paper, we present near-optimal space bounds for Lp-samplers. Given a stream of updates (additions and subtraction) to the coordinates of an underlying vector x in Rn, a perfect Lp sampler outputs the i-th coordinate with probability xipxpp. In ...
Pan-private algorithms via statistics on sketches
Consider fully dynamic data, where we track data as it gets inserted and deleted. There are well developed notions of private data analyses with dynamic data, for example, using differential privacy. We want to go beyond privacy, and consider privacy ...
FIFO indexes for decomposable problems
This paper studies first-in-first-out (FIFO) indexes, each of which manages a dataset where objects are deleted in the same order as their insertions. We give a technique that converts a static data structure to a FIFO index for all decomposable ...
Data exchange beyond complete data
In the traditional data exchange setting, source instances are restricted to be complete in the sense that every fact is either true or false in these instances. Although natural for a typical database translation scenario, this restriction is gradually ...
Incomplete information and certain answers in general data models
While incomplete information is ubiquitous in all data models - especially in applications involving data translation or integration - our understanding of it is still not completely satisfactory. For example, even such a basic notion as certain answers ...
Determining the currency of data
Data in real-life databases become obsolete rapidly. One often finds that multiple values of the same entity reside in a database. While all of these values were once correct, most of them may have become stale and inaccurate. Worse still, the values ...
New results on two-dimensional orthogonal range aggregation in external memory
We consider the orthogonal range aggregation problem. The dataset S consists of N axis-parallel rectangles in R2, each of which is associated with an integer weight. Given an axis-parallel rectangle Q and an aggregate function F, a query reports the ...
On finding skylines in external memory
We consider the skyline problem (a.k.a. the maxima problem), which has been extensively studied in the database community. The input is a set P of d-dimensional points. A point dominates another if the former has a lower coordinate than the latter on ...
Beyond simple aggregates: indexing for summary queries
Database queries can be broadly classified into two categories: reporting queries and aggregation queries. The former retrieves a collection of records from the database that match the query's conditions, while the latter returns an aggregate, such as ...
Space-efficient substring occurrence estimation
We study the problem of estimating the number of occurrences of substrings in textual data: A text T on some alphabet £ of size à is preprocessed and an index I is built. The index is used in lieu of the text to answer queries of the form CountH(P), ...
Provenance for aggregate queries
We study in this paper provenance information for queries with aggregation. Provenance information was studied in the context of various query languages that do not allow for aggregation, and recent work has suggested to capture provenance by annotating ...
On provenance minimization
Provenance information has been proved to be very effective in capturing the computational process performed by queries, and has been used extensively as the input to many advanced data management tools (e.g. view maintenance, trust assessment, or query ...
On the complexity of privacy-preserving complex event processing
Complex Event Processing (CEP) Systems are stream processing systems that monitor incoming event streams in search of userspecified event patterns. While CEP systems have been adopted in a variety of applications, the privacy implications of event ...
Provenance views for module privacy
Scientific workflow systems increasingly store provenance information about the module executions used to produce a data item, as well as the parameter settings and intermediate data items passed between module executions. However, authors/owners of ...
Querying graph patterns
Graph data appears in a variety of application domains, and many uses of it, such as querying, matching, and transforming data, naturally result in incompletely specified graph data, i.e., graph patterns. While queries need to be posed against such data,...
Maximizing conjunctive views in deletion propagation
In deletion propagation, tuples from the database are deleted in order to reflect the deletion of a tuple from the view. Such an operation may result in the (often necessary) deletion of additional tuples from the view, besides the intentionally deleted ...
Determining relevance of accesses at runtime
Consider the situation where a query is to be answered using Web sources that restrict the accesses that can be made on backend relational data by requiring some attributes to be given as input of the service. The accesses provide lookups on the ...
Parallel evaluation of conjunctive queries
The availability of large data centers with tens of thousands of servers has led to the popular adoption of massive parallelism for data analysis on large datasets. Several query languages exist for running queries on massively parallel architectures, ...
Querying semantic web data with SPARQL
The Semantic Web is the initiative of the W3C to make information on the Web readable not only by humans but also by machines. RDF is the data model for Semantic Web data, and SPARQL is the standard query language for this data model. In the last ten ...
Theory of data stream computing: where to go
Computing power has been growing steadily, just as communication rate and memory size. Simultaneously our ability to create data has been growing phenomenally and therefore the need to analyze it. We now have examples of massive data streams that are ...
The complexity of text-preserving XML transformations
While XML is nowadays adopted as the de facto standard for data exchange, historically, its predecessor SGML was invented for describing electronic documents, i.e., marked up text. Actually, today there are still large volumes of such XML texts. We ...
Efficient evaluation for a temporal logic on changing XML documents
We consider a sequence t1,...,tk of XML documents that is produced by a sequence of local edit operations. To describe properties of such a sequence, we use a temporal logic. The logic can navigate both in time and in the document, e.g. a formula can ...
Finding a minimal tree pattern under neighborhood constraints
Tools that automatically generate queries are useful when schemas are hard to understand due to size or complexity. Usually, these tools find minimal tree patterns that contain a given set (or bag) of labels. The labels could be, for example, XML tags ...
A rule-based language for web data management
There is a new trend to use Datalog-style rule-based languages to specify modern distributed applications, notably on the Web. We introduce here such a language for a distributed data model where peers exchange messages (i.e. logical facts) as well as ...
Relational transducers for declarative networking
Motivated by a recent conjecture concerning the expressiveness of declarative networking, we propose a formal computation model for "eventually consistent" distributed querying, based on relational transducers. A tight link has been conjectured between ...
Rewrite rules for search database systems
The results of a search engine can be improved by consulting auxiliary data. In a search database system, the association between the user query and the auxiliary data is driven by rewrite rules that augment the user query with a set of alternative ...
Recommendations
Acceptance Rates
Year | Submitted | Accepted | Rate |
---|---|---|---|
PODS '19 | 87 | 29 | 33% |
PODS '17 | 101 | 29 | 29% |
PODS '16 | 94 | 31 | 33% |
PODS '15 | 80 | 25 | 31% |
PODS '14 | 67 | 22 | 33% |
PODS '13 | 97 | 24 | 25% |
PODS '12 | 101 | 26 | 26% |
PODS '11 | 113 | 25 | 22% |
PODS '10 | 113 | 27 | 24% |
PODS '09 | 97 | 26 | 27% |
PODS '08 | 159 | 28 | 18% |
PODS '07 | 187 | 28 | 15% |
PODS '06 | 185 | 35 | 19% |
PODS '03 | 136 | 27 | 20% |
PODS '02 | 109 | 24 | 22% |
PODS '01 | 99 | 26 | 26% |
PODS '00 | 119 | 26 | 22% |
PODS '99 | 116 | 32 | 28% |
PODS '98 | 119 | 28 | 24% |
PODS '97 | 118 | 23 | 19% |
PODS '96 | 84 | 22 | 26% |
PODS '95 | 94 | 25 | 27% |
PODS '94 | 117 | 28 | 24% |
PODS '93 | 115 | 26 | 23% |
Overall | 2,707 | 642 | 24% |