For over two decades, the International Conference on Management of Data (COMAD), modeled along the lines of ACM SIGMOD, has been the premier international database conference hosted in India by Division II of Computer Society of India. The first COMAD was held in 1989, and it has been held on a nearly annual basis since then (except for a few breaks such as in the years when VLDB and ICDE were held in India). COMAD has always had a significant international participation, with about 30% of the papers being from outside India, including Europe, USA and East/South-East Asia.
This year, COMAD 2012, the 18th version of the conference, is being held in Pune from December 14, 2012 to December 16, 2012.
Proceeding Downloads
Reimagining textbooks through the data lens
Textbooks are the primary vehicles for delivering subject knowledge to the students and are known to be the educational input most consistently associated with improvements in student learning. With the emergence of abundant online content, cloud ...
User activity analytics on the social web of news
The proliferation of social media is undoubtedly changing the way people produce and consume news online. Editors and publishers in newsrooms need to understand user engagement and audience sentiment evolution on various news topics. News consumers want ...
Building knowledge bases from the web
The web is a vast repository of human knowledge. Extracting structured data from web pages can enable applications like comparison shopping, and lead to improved ranking and rendering of search results. In this talk, I will describe two efforts to ...
Spatio-temporal indexing: current scenario, challenges and approaches
With rapid advancements in computing hardware, tracking devices such as GPS receivers and sensors have become pervasive, generating a large amount of spatio-temporal data, such as measurements of temperature, pressure, air quality, traffic, etc. using ...
Big data technologies circa 2012
The growth of the World Wide Web has led to an astronomical amount of data being generated. More recently, the amount of user-generated content has seen tremendous expansion thanks to social media like Facebook and Twitter. Enterprises, researchers, and ...
Markov logic networks: theory, algorithms and applications
Most real world problems are characterized by relational structure i.e. entities and relationships between them. Further, they are inherently uncertain in nature. Theory of logic gives the framework to represent relations. Statistics provides the tools ...
Reliability aware data fusion
Due to ubiquitous sensors (GPS, Accelerometer), easy of use apps (Facebook, Twitter etc), presence of audio & video recording devices and higher internet connectivity, the key characteristics of raw data is changing. This new data can be characterized ...
Connectivity-tolerant query optimization over distributed mobile repositories
Query processing and optimization in centralized and distributed environments is well-researched. Centralized query optimization focused on minimizing the number of input/output (or I/O) from disk. Distributed query processing focused mainly on ...
Context aware ontology based information extraction
We have developed an ontology based information extraction system where property and relation name occurrences are used to identify domain entities using patterns written in terms of dependency relations. Our key intuition is that, with respect to a ...
REBOM: recovery of blocks of missing values in time series
The recovery of blocks of missing values in regular time series has been addressed by model-based techniques. Such techniques are not suitable to recover blocks of missing values in irregular time series and restore peaks and valley. We propose REBOM (...
A novel query-based approach for addressing summarizability issues in XOLAP
The business intelligence and decision-support systems used in many application domains casually rely on data warehouses, which are decision-oriented data repositories modeled as multidimensional (MD) structures. MD structures help navigate data through ...
Hybrid HBase: leveraging flash SSDs to improve cost per throughput of HBase
Column-oriented data stores, such as BigTable and HBase, have successfully paved the way for managing large key-value datasets with random accesses. At the same time, the declining cost of flash SSDs have enabled their use in several applications ...
Entity ranking and relationship queries using an extended graph model
There is a large amount of textual data on the Web and in Wikipedia, where mentions of entities (such as Gandhi) are annotated with a link to the disambiguated entity (such as M. K. Gandhi). Such annotation may have been done manually (as in Wikipedia) ...
Towards efficient discovery of frequent patterns with relative support
Frequent patterns are an important class of regularities that exist in a database. Although there exists no universally acceptable best measure to assess the interestingness of a pattern, relative support is emerging as a popular measure to discover ...
Excel solvers for the traveling salesman problem
Ordering queries within a workload and ordering joins in a query are important problems in databases [1]. We give algorithms for the query sequencing problem that scale (small space) and are efficient (low runtime) as compared to earlier work [4]. The ...
A lightweight distributed order and duplication insensitive algorithm for approximate top-k queries using order statistics
Let {e1, e2,..., el} be a set of distinct records in a database, with unique IDs {id1, id2,..., idl}. Let A1, A2,..., Ap be a set of distinct attributes for each record. For every record ei, the attribute Aj is zero or some positive value. We denote the ...
Who's who: linking user's multiple identities on online social media
On online social media, users join new online social networks (OSNs) to exploit variety of services while maintaining their old identities on other OSNs. A user maintains an identity on each OSN mentioning metadata (e.g. profile information) about her. ...
MODETL: a complete MODeling and ETL method for designing data warehouses from semantic databases
In last decades, Semantic DataBases (SDB) have emerged and the major DBMS editors provide semantic support in their products. This is mainly due to the spectacular development of ontologies in several important domains like E-commerce, Engineering, ...
Web personalization and recommender systems: an overview
Information overload is the major problem of today's Internet use. User frequently gets much more information than needed. Also much of the information which the user gets is less relevant and very few links, items, or contents are really useful. To get ...
Efficient approximate dictionary matching
Named entity recognition (NER) systems are important for extracting useful information from unstructured data sources. It is known that large domain dictionaries help in improving extraction performance of NER. Unstructured text usually contains entity ...