[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3415958.3433054acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmedesConference Proceedingsconference-collections
research-article

Machine Learning Pipeline for Reusing Pretrained Models

Published: 27 November 2020 Publication History

Abstract

Machine learning methods have proven to be effective in analyzing vast amounts of data in various formats to obtain patterns, detect trends, gain insight, and predict outcomes based on historical data. However, training models from scratch across various real-world applications is costly in terms of both time and data consumption. Model adaptation (Domain Adaptation) is a promising methodology to tackle this problem. It can reuse the knowledge embedded in an existing model to train another model. However, model adaptation is a challenging task due to dataset bias or domain shift. In addition, data access from both the original (source) domain and the destination (target) domain is often an issue in the real world, due to data privacy and cost issues (gathering additional data may cost money). Several domain adaptation algorithms and methodologies have introduced in recent years; they reuse trained models from one source domain for a different but related target domain. Many existing domain adaptation approaches aim at modifying the trained model structure or adjusting the latent space of the target domain using data from the source domain. Domain adaptation techniques can be evaluated over several criteria, namely, accuracy, knowledge transfer, training time, and budget. In this paper, we start from the notion that in many real-world scenarios, the owner of the trained model restricts access to the model structure and the source dataset. To solve this problem, we propose a methodology to efficiently select data from the target domain (minimizing consumption of target domain data) to adapt the existing model without accessing the source domain, while still achieving acceptable accuracy. Our approach is designed for supervised and semi-supervised learning and extendable to unsupervised learning.

References

[1]
Chen, Z. et al. 2019. Blending-target Domain Adaptation by Adversarial Meta-Adaptation Networks. CVPR. (2019), 2248--2257.
[2]
Csurka, G. 2017. Domain Adaptation for Visual Applications: A Comprehensive Survey. arXiv:1702.05374 (2017), 1--46.
[3]
Damodaran, B.B. et al. 2018. DeepJDOT: Deep Joint Distribution Optimal Transport for Unsupervised Domain Adaptation. ECCV (2018).
[4]
Daume III, H. 2007. Frustratingly Easy Domain Adaptation. Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics. (2007).
[5]
Ditzler, G. et al. 2015. Learning in Nonstationary Environments: A Survey. IEEE Computational Intelligence Magazine. 10, 4 (2015), 12--25.
[6]
Ganin, Y. et al. 2016. Domain-Adversarial Training of Neural Networks. Journal of Machine Learning Research 17. 17, (2016), 1--35.
[7]
Grana, C. et al. 2013. Image classification with multivariate Gaussian descriptors. Proceedings of the 17th International Conference on Image Analysis and Processing. 8157 LNCS, PART 2 (2013), 111--120.
[8]
Hong, Y. et al. 2019. How Generative Adversarial Networks and Their Variants Work: An Overview. ACM Computing Surveys (CSUR). (2019).
[9]
Hosseini-asl, E. et al. 2019. Augmented cyclic adversarial learning for low resource domain adaptation. ICLR. (2019), 1--14.
[10]
Hu, L. et al. 2018. Duplex Generative Adversarial Network for Unsupervised Domain Adaptation. CVPR (2018), 1498--1507.
[11]
Kouw, W.M. and Loog, M. 2019. A review of domain adaptation without target labels. arXiv:1901.05335. (2019).
[12]
Liang, J. et al. 2019. Distant Supervised Centroid Shift: A Simple and Efficient Approach to Visual Domain Adaptation. CVPR (2019).
[13]
Mahadevan, S. et al. 2018. A Unified Framework for Domain Adaptation using Metric Learning on Manifolds. arXiv:1804.10834 (2018).
[14]
Motiian, S. et al. 2017. Few-Shot Adversarial Domain Adaptation arXiv: 1711. 02536v1 [cs. CV] 5 Nov 2017. NIPS'17 Proceedings of the 31st International Conference on Neural Information Processing Systems. (2017).
[15]
Motiian, S. et al. 2017. Unified Deep Supervised Domain Adaptation and Generalization. ICCV. (2017).
[16]
Pan, S.J. and Yang, Q. 2009. A Survey on Transfer Learning. IEEE Transactions on Knowledge and Data Engineering (2009), 1--15.
[17]
Quionero-Candela, J. et al. 2009. Dataset Shift in Machine Learning.
[18]
Shen, J. et al. 2018. Wasserstein Distance Guided Representation Learning for Domain Adaptation. AAAI (2018).
[19]
Shu, R. et al. 2018. A DIRT-T Approach to Unsupervised Domain Adaptation. ICLR (2018).
[20]
Torralba, A. and Efros, A.A. 2011. Unbiased look at dataset bias. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2011), 1521 --1528.
[21]
Tzeng, E. et al. 2017. Adversarial Discriminative Domain Adaptation. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017).
[22]
Wang, M. and Deng, W. 2018. Deep Visual Domain Adaptation: A Survey. Neurocomputing. 312, (2018), 1--20.

Cited By

View all
  • (2022)Do Auto-Regressive Models Protect Privacy? Inferring Fine-Grained Energy Consumption From Aggregated Model ParametersIEEE Transactions on Services Computing10.1109/TSC.2021.310049815:6(3198-3209)Online publication date: 1-Nov-2022
  • (2022)Machine learning solutions in sewer systems: a bibliometric analysisUrban Water Journal10.1080/1573062X.2022.213846020:1(1-14)Online publication date: 28-Oct-2022
  • (2022)Prostate cancer classification from ultrasound and MRI images using deep learning based Explainable Artificial IntelligenceFuture Generation Computer Systems10.1016/j.future.2021.09.030127:C(462-472)Online publication date: 1-Feb-2022

Index Terms

  1. Machine Learning Pipeline for Reusing Pretrained Models

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    MEDES '20: Proceedings of the 12th International Conference on Management of Digital EcoSystems
    November 2020
    170 pages
    ISBN:9781450381154
    DOI:10.1145/3415958
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 27 November 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Domain Adaptation
    2. Knowledge Transfer
    3. Model Reuse
    4. Supervised Learning
    5. Transfer Learning

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    MEDES '20
    MEDES '20: 12th International Conference on Management of Digital EcoSystems
    November 2 - 4, 2020
    Virtual Event, United Arab Emirates

    Acceptance Rates

    MEDES '20 Paper Acceptance Rate 19 of 27 submissions, 70%;
    Overall Acceptance Rate 267 of 682 submissions, 39%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)13
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 01 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Do Auto-Regressive Models Protect Privacy? Inferring Fine-Grained Energy Consumption From Aggregated Model ParametersIEEE Transactions on Services Computing10.1109/TSC.2021.310049815:6(3198-3209)Online publication date: 1-Nov-2022
    • (2022)Machine learning solutions in sewer systems: a bibliometric analysisUrban Water Journal10.1080/1573062X.2022.213846020:1(1-14)Online publication date: 28-Oct-2022
    • (2022)Prostate cancer classification from ultrasound and MRI images using deep learning based Explainable Artificial IntelligenceFuture Generation Computer Systems10.1016/j.future.2021.09.030127:C(462-472)Online publication date: 1-Feb-2022

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media