[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

MLog: towards declarative in-database machine learning

Published: 01 August 2017 Publication History

Abstract

We demonstrate MLog, a high-level language that integrates machine learning into data management systems. Unlike existing machine learning frameworks (e.g., TensorFlow, Theano, and Caffe), MLog is declarative, in the sense that the system manages all data movement, data persistency, and machine-learning related optimizations (such as data batching) automatically. Our interactive demonstration will show audience how this is achieved based on the novel notion of tensoral views (TViews), which are similar to relational views but operate over tensors with linear algebra. With MLog, users can succinctly specify not only simple models such as SVM (in just two lines), but also sophisticated deep learning models that are not supported by existing in-database analytics systems (e.g., MADlib, PAL, and SciDB), as a series of cascaded TViews. Given the declarative nature of MLog, we further demonstrate how query/program optimization techniques can be leveraged to translate MLog programs into native TensorFlow programs. The performance of the automatically generated Tensor-Flow programs is comparable to that of hand-optimized ones.

References

[1]
P. G. Brown. Overview of scidb: Large scale array storage, processing and analysis. In SIGMOD, 2010.
[2]
S. Cohen, W. Nutt, and Y. Sagiv. Deciding equivalences among conjunctive aggregate queries. J. ACM, 2007.
[3]
S. Hadjis, F. Abuzaid, C. Zhang, and C. Ré. Caffe Con Troll: Shallow ideas to speed up deep learning. In DanaC, 2015.
[4]
J. M. Hellerstein et al. The MADlib analytics library: Or mad skills, the sql. Proc. VLDB Endow., 2012.
[5]
J. Jiang, B. Cui, C. Zhang, and L. Yu. Heterogeneity-aware distributed parameter servers. In SIGMOD. ACM, 2017.
[6]
J. Jiang, L. Yu, J. Jiang, Y. Liu, and B. Cui. Angel: a new large-scale machine learning system. National Science Review, 2017.
[7]
A. Kumar, J. Naughton, and J. M. Patel. Learning generalized linear models over normalized data. In SIGMOD, 2015.
[8]
M. Lin, Q. Chen, and S. Yan. Network In Network. ICLR, 2014.
[9]
J. MacGregor. Predictive Analysis with SAP: The Comprehensive Guide. SAP PRESS, 2013.
[10]
M. Schleich, D. Olteanu, and R. Ciucanu. Learning linear regression models over factorized joins. In SIGMOD, 2016.
[11]
J. Seib and G. Lausen. Parallelizing Datalog programs by generalized pivoting. In PODS, 1991.

Cited By

View all
  • (2024)Database Native Model Selection: Harnessing Deep Neural Networks in Database SystemsProceedings of the VLDB Endowment10.14778/3641204.364121217:5(1020-1033)Online publication date: 2-May-2024
  • (2024)Pre-LogMGAE: Identification of Log Anomalies Using a Pre-Trained Masked Graph Autoencoder2024 43rd International Symposium on Reliable Distributed Systems (SRDS)10.1109/SRDS64841.2024.00036(294-306)Online publication date: 30-Sep-2024
  • (2023)JoinBoost: Grow Trees over Normalized Data Using Only SQLProceedings of the VLDB Endowment10.14778/3611479.361150916:11(3071-3084)Online publication date: 1-Jul-2023
  • Show More Cited By
  1. MLog: towards declarative in-database machine learning

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Proceedings of the VLDB Endowment
    Proceedings of the VLDB Endowment  Volume 10, Issue 12
    August 2017
    427 pages
    ISSN:2150-8097
    Issue’s Table of Contents

    Publisher

    VLDB Endowment

    Publication History

    Published: 01 August 2017
    Published in PVLDB Volume 10, Issue 12

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)71
    • Downloads (Last 6 weeks)5
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Database Native Model Selection: Harnessing Deep Neural Networks in Database SystemsProceedings of the VLDB Endowment10.14778/3641204.364121217:5(1020-1033)Online publication date: 2-May-2024
    • (2024)Pre-LogMGAE: Identification of Log Anomalies Using a Pre-Trained Masked Graph Autoencoder2024 43rd International Symposium on Reliable Distributed Systems (SRDS)10.1109/SRDS64841.2024.00036(294-306)Online publication date: 30-Sep-2024
    • (2023)JoinBoost: Grow Trees over Normalized Data Using Only SQLProceedings of the VLDB Endowment10.14778/3611479.361150916:11(3071-3084)Online publication date: 1-Jul-2023
    • (2023)On Higher-performance Sparse Tensor Transposition2023 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)10.1109/IPDPSW59300.2023.00118(697-701)Online publication date: May-2023
    • (2023)The Robot@Home2 dataset: A new release with improved usability toolsSoftwareX10.1016/j.softx.2023.10149023(101490)Online publication date: Jul-2023
    • (2023)Towards an Integrated Rough Set and Data Modelling Framework for Data Management and Knowledge ExtractionArtificial Intelligence and Smart Environment10.1007/978-3-031-26254-8_116(800-805)Online publication date: 8-Mar-2023
    • (2022)ConnectorXProceedings of the VLDB Endowment10.14778/3551793.355184715:11(2994-3003)Online publication date: 1-Jul-2022
    • (2022)ReACTProceedings of the International Conference on Parallel Architectures and Compilation Techniques10.1145/3559009.3569685(1-13)Online publication date: 8-Oct-2022
    • (2022)Database Meets Artificial Intelligence: A SurveyIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2020.299464134:3(1096-1116)Online publication date: 1-Mar-2022
    • (2022)A Comparative Study of in-Database Inference Approaches2022 IEEE 38th International Conference on Data Engineering (ICDE)10.1109/ICDE53745.2022.00180(1794-1807)Online publication date: May-2022
    • Show More Cited By

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media