Abstract
The chapter gives an overview of the current state of the Sindbad system and planned extensions. Following an introduction to the system and its query language SiQL, we present application scenarios from the areas of gene expression/regulation and small molecules. Next, we describe a web service interface to Sindbad that enables new possibilities for inductive databases (distributing tasks over multiple servers, language and platform independence, …). Finally, we discuss future plans for the system, in particular, to make the system more ‘declarative’ by the use of signatures, to integrate the useful concept of mining views into the system, and to support specific pattern domains like graphs and strings.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
R. Agrawal, T. Bollinger, C.W. Clifton, S. Dzeroski, J.-C. Freytag, J. Gehrke, J. Hipp, D.A. Keim, S. Kramer, H.-P. Kriegel, B. Liu, H. Mannila, R. Meo, S. Morishita, R.T. Ng, J. Pei, P. Raghavan, R. Ramakrishnan, M. Spiliopoulou, J. Srivastava, V. Torra, and A. Tuzhilin. Data mining: The next generation. Report based on a Dagstuhl perspectives workshop organized by R. Agrawal, J-C. Freytag, and R. Ramakrishnan, 2005.
H. Blockeel, T. Calders, É. Fromont, B. Goethals, and A. Prado. Mining views: Database views for data mining. In Proceedings of the International Workshop on Constrained-Bawsed Mining andLearning, 2007.
H. Blockeel, T. Calders, É. Fromont, B. Goethals, and A. Prado. Mining views: Database views for data mining. In Proceedings of the IEEE International Conference on Data Engineering, 2008.
H. Blockeel, T. Calders, E. Fromont, B. Goethals, A. Prado, and C. Robardet. An inductive database prototype based on virtual mining views. In KDD ’08: Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1061–1064, New York, NY, USA, 2008. ACM.
M. Botta, Boulicaut J.-F., C. Masson, and R. Meo. Query languages supporting descriptive rule mining: A comparative study. In Database Support for Data Mining Applications, pages 24–51, 2004.
C. J. Date. An Introduction to Database Systems. Addison Wesley, 4th edition, 1986.
L. De Raedt and S. Kramer. The levelwise version space algorithm and its application to molecular fragment finding. In Proc. 17th International Joint Conference on Artificial Intelligence (IJCAI 2001, Seattle, USA), pages 853–862. Morgan Kaufmann, San Francisco, CA, USA, 2001.
P. Domingos. Structured machine learning: Ten problems for the next ten years. In Proceedings of Seventeenth International Conference on Inductive Logic Programming, Corvallis, Oregon, 2007. Springer.
C. Ferris, D. Booth, M. Champion, H. Haas, D. Orchard, E. Newcomer, and F. McCabe. Web services architecture. W3C note, W3C, 2004. http://www.w3.org/TR/2004/NOTE-ws-arch-20040211/.
J. Fischer, V. Heun, and S. Kramer. Fast frequent string mining using suffix arrays. In Proceedings of the Fifth IEEE International Conference on Data Mining. IEEE Computer Society Press, 2005.
J. Fischer, V. Heun, and S. Kramer. Optimal string mining under frequency constraints. In Proceedings of the 10th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD 2006), pages 139–150, 2006.
S. Fröhler and S. Kramer. Inductive logic programming for gene regulation prediction. Machine Learning, 70(2-3):225–240, 2008.
M. Garofalakis, D. Hyun, R. Rastogi, and K. Shim. Efficient algorithms for constructing decision trees with constraints. In KDD ’00: Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 335–339, New York, NY, USA, 2000. ACM.
T.R. Golub, D.K. Slonim, P. Tamayo, P. Huard, M. Gaasenbeek, J.P. Mesirov, H. Coller, M.L. Loh, J.R. Downing, M.A. Caligiuri, C.D. Bloomfield, and E.S. Lander. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science, 286(5439):531–7, 1999.
J. Han, Y. Fu, W. Wang, K. Koperski, and O. Zaiane. DMQL: A data mining query language for relational databases. In SIGMOD’96 Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD’96), Montreal, Canada, 1996.
T. Imielinski and A. Virmani. MSQL: A query language for database mining. Data Min. Knowl. Discov, 3(4):373–408, 1999.
Boulicaut J.-F. and C. Masson. Data mining query languages. In O. Maimon and L. Rokach, editors, The Data Mining and Knowledge Discovery Handbook, pages 715–727. Springer, 2005.
S. Kramer, V. Aufschild, A. Hapfelmeier, A. Jarasch, K. Kessler, S. Reckow, J. Wicker, and L. Richter. Inductive databases in the relational model: The data as the bridge. In Francesco Bonchi and Jean-François Boulicaut, editors, Proceedings of the Fourth International Workshop on Knowledge Discovery in Inductive Databases (KDID 2005), volume 3933 of Lecture Notes in Computer Science, pages 124–138. Springer, 2005.
S. Kramer, L. De De Raedt, and C. Helma. Molecular feature mining in HIV data. In Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-01), pages 136–143, 2001.
R. Meo, G. Psaila, and S. Ceri. An extension to SQL for mining association rules. Data Mining and Knowledge Discovery, 2(2):195–224, 1998.
J. R. Quinlan. Learning logical definitions from relations. Machine Learning, 5:239, 1990.
L. Richter, J. Wicker, K. Kessler, and S. Kramer. An inductive database and query language in the relational model. In Proceedings of the 10th International Conference on Extending Database Technology (EDBT 2008), pages 740–744. ACM Press, 2008.
O.S. Weislow, R. Kiser, D.L. Fine, J.P. Bader, R.H. Shoemakerand, and M.R. Boyd. New soluble formazan assay for HIV-1 cytopathic effects: application to high flux screening of synthetic and natural products for aids antiviral activity. Journal of the National Cancer Institute, 81:577–586, 1989.
J. Wicker, C. Brosdau, L. Richter, and S. Kramer. SINDBAD SAILS: A service architecture for inductive learning schemes. In Nada Lavrač, Joost Kok, Jeroen de Bruin, and Vid Podpečan, editors, Proceedings of the First Workshop on Third Generation Data Mining: Towards Service-Oriented Knowledge Discovery, 2008.
J.Wicker, L. Richter, K. Kessler, and S. Kramer. SINDBAD and SiQL: An inductive database and query language in the relational model. In Walter Daelemans, Bart Goethals, and Katharina Morik, editors, Machine Learning and Knowledge Discovery in Databases, European Conference, ECML/PKDD 2008, Antwerp, Belgium, September 15-19, 2008, Proceedings, Part II, pages 690–694. Springer, 2008.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Wicker, J., Richter, L., Kramer, S. (2010). SINDBAD and SiQL: Overview, Applications and Future Developments. In: Džeroski, S., Goethals, B., Panov, P. (eds) Inductive Databases and Constraint-Based Data Mining. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-7738-0_12
Download citation
DOI: https://doi.org/10.1007/978-1-4419-7738-0_12
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-7737-3
Online ISBN: 978-1-4419-7738-0
eBook Packages: Computer ScienceComputer Science (R0)