Abstract
In era of Big Data different applications face the problem of dealing with incomplete data. In the presence of incomplete databases, certain answers are a principled semantics of query answering. Unfortunately, the computation of certain query answers is a coNP-hard problem. To make query answering feasible in practice, recent research has focused on developing polynomial time algorithms computing a sound (but possibly incomplete) set of certain answers. In this chapter, we discuss several recently proposed approximation algorithms, along with a system prototype implementing them and experimental evaluation. The central tools are conditional tables and the conditional evaluation of relation algebra. Different evaluation strategies can be applied, with more accurate ones having higher complexity, but returning more certain answers, thereby enabling users to choose the technique that best meets their needs in terms of balance between efficiency and quality of the results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Transaction Processing Performance Council. TPC benchmark h standard specification (2014). Revision 2.17.1. (TPC-H)
Abiteboul, S., Grahne, G.: Update semantics for incomplete databases. In: Proceedings of Very Large Data Bases (VLDB) Conference, pp. 1–12 (1985)
Arenas, M., Bertossi, L.E., Chomicki, J.: Consistent query answers in inconsistent databases. In: Proceedings of the Symposium on Principles of Database Systems (PODS), pp. 68–79 (1999)
Bertossi, L.E.: Database Repairing and Consistent Query Answering. Synthesis Lectures on Data Management. Morgan & Claypool Publishers, San Rafael (2011)
Bienvenu, M., Ortiz, M.: Ontology-mediated query answering with data-tractable description logics. In: Reasoning Web, pp. 218–307 (2015)
Calautti, M., Greco, S., Molinaro, C., Trubitsyna, I.: Exploiting equality generating dependencies in checking chase termination. PVLDB 9(5), 396–407 (2016)
Calautti, M., Greco, S., Molinaro, C., Trubitsyna, I.: Using linear constraints for logic program termination analysis. TPLP 16(3), 353–377 (2016)
Calautti, M., Greco, S., Spezzano, F., Trubitsyna, I.: Checking termination of bottom-up evaluation of logic programs with function symbols. TPLP 15(6), 854–889 (2015)
Calautti, M., Greco, S., Trubitsyna, I.: Detecting decidable classes of finitely ground logic programs with function symbols. ACM Trans. Comput. Log. 18(4), 28:1–28:42 (2017)
Calì, A., Gottlob, G., Lukasiewicz, T.: A general datalog-based framework for tractable query answering over ontologies. J. Web Semant. 14, 57–83 (2012)
Caroprese, L., Trubitsyna, I., Truszczynski, M., Zumpano, E.: The view-update problem for indefinite databases. In: Logics in Artificial Intelligence - 13th European Conference, JELIA 2012, Toulouse, France, September 26–28, 2012. Proceedings. pp. 134–146 (2012)
Caroprese, L., Trubitsyna, I., Truszczynski, M., Zumpano, E.: A measure of arbitrariness in abductive explanations. TPLP 14(4–5), 665–679 (2014)
Console, M., Guagliardo, P., Libkin, L.: Approximations and refinements of certain answers via many-valued logics. In: Proceedings of the International Conference on Principles of Knowledge Representation and Reasoning (KR), pp. 349–358 (2016)
De Giacomo, G., Lembo, D., Lenzerini, M., Rosati, R.: On reconciling data exchange, data integration, and peer data management. In: Proceedings of the Symposium on Principles of Database Systems (PODS), pp. 133–142 (2007)
Fiorentino, N., Greco, S., Molinaro, C., Trubitsyna, I.: ACID: A system for computing approximate certain query answers over incomplete databases. In: Proceedings of the International Conference on Management of Data (SIGMOD), pp. 1685–1688 (2018)
Furfaro, F., Greco, S., Molinaro, C.: A three-valued semantics for querying and repairing inconsistent databases. Ann. Math. Artif. Intell. 51(2–4), 167–193 (2007)
Grahne, G.: The Problem of Incomplete Information in Relational Databases. Lecture Notes in Computer Science, vol. 554. Springer, Berlin (1991)
Greco, S., Molinaro, C., Spezzano, F.: Incomplete Data and Data Dependencies in Relational Databases. Synthesis Lectures on Data Management. Morgan & Claypool Publishers, San Rafael (2012)
Greco, S., Molinaro, C., Trubitsyna, I.: Computing approximate certain answers over incomplete databases. In: Proceedings of the Alberto Mendelzon International Workshop on Foundations of Data Management and the Web (AMW) (2017)
Greco, S., Molinaro, C., Trubitsyna, I.: Approximation algorithms for querying incomplete databases. Inf. Syst. 86, 28–45 (2019). https://doi.org/10.1016/j.is.2019.03.010
Greco, S., Spezzano, F., Trubitsyna, I.: Checking chase termination: Cyclicity analysis and rewriting techniques. IEEE Trans. Knowl. Data Eng. 27(3), 621–635 (2015)
Guagliardo, P., Libkin, L.: Making SQL queries correct on incomplete databases: A feasibility study. In: Proceedings of the Symposium on Principles of Database Systems (PODS), pp. 211–223 (2016)
Imielinski, T., Lipski, Jr., W.: Incomplete information in relational databases. J. ACM 31(4), 761–791 (1984)
Koutris, P., Wijsen, J.: The data complexity of consistent query answering for self-join-free conjunctive queries under primary key constraints. In: Proceeding of the Symposium on Principles of Database Systems (PODS), pp. 17–29 (2015)
Lenzerini, M.: Data integration: a theoretical perspective. In: Proceedings of the Symposium on Principles of Database Systems (PODS), pp. 233–246 (2002)
Libkin, L.: How to define certain answers. In: Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), pp. 4282–4288 (2015)
Libkin, L.: SQL’s three-valued logic and certain answers. ACM Trans. Database Syst. 41(1), 1 (2016)
Lipski, W.: On relational algebra with marked nulls. In: Proceedings of the Symposium on Principles of Database Systems (PODS), pp. 201–203 (1984)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Fiorentino, N., Molinaro, C., Trubitsyna, I. (2020). Approximate Query Answering over Incomplete Data. In: Appice, A., Ceci, M., Loglisci, C., Manco, G., Masciari, E., Ras, Z. (eds) Complex Pattern Mining. Studies in Computational Intelligence, vol 880. Springer, Cham. https://doi.org/10.1007/978-3-030-36617-9_13
Download citation
DOI: https://doi.org/10.1007/978-3-030-36617-9_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-36616-2
Online ISBN: 978-3-030-36617-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)