extended-abstract

Convergence time analysis of Asynchronous Distributed Artificial Neural Networks

Authors:

CODS-COMAD '22: Proceedings of the 5th Joint International Conference on Data Science & Management of Data (9th ACM IKDD CODS and 27th COMAD)

Pages 314 - 315

https://doi.org/10.1145/3493700.3493758

Published: 08 January 2022 Publication History

Get Access

Abstract

Artificial Neural Networks (ANNs) have drawn academy and industry attention for their ability to represent and solve complex problems. Researchers are studying how to distribute their computation to reduce their training time. However, the most common approaches in this direction are synchronous, letting computational resources sub-utilized. Asynchronous training does not have this drawback but is impacted by staled gradient updates, which have not been extended researched yet. Considering this, we experimentally investigate how stale gradients affect the convergence time and loss value of an ANN. In particular, we analyze an asynchronous distributed implementation of a Word2Vec model, in which the impact of staleness is negligible and can be ignored considering the computational speedup we achieve by allowing the staleness.

References

[1]

Nitish Shirish Keskar, Dheevatsa Mudigere, Jorge Nocedal, Mikhail Smelyanskiy, and Ping Tak Peter Tang. 2017. On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima. In 5th International Conference on Learning Representations, ICLR 2017, Toulon, France. OpenReview.net.

Google Scholar

[2]

Benjamin Recht, Christopher Re, Stephen Wright, and Feng Niu. 2011. Hogwild!: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent. In Advances in Neural Information Processing Systems, Vol. 24. Curran Associates, Inc.

Google Scholar

[3]

Alexander Sergeev and Mike Del Balso. 2018. Horovod: fast and easy distributed deep learning in TensorFlow. arXiv preprint arXiv:1802.05799(2018).

Google Scholar

[4]

S. Varrette, P. Bouvry, H. Cartiaux, and F. Georgatos. 2014. Management of an Academic HPC Cluster: The UL Experience. In Proc. of the 2014 Intl. Conf. on High Performance Computing & Simulation (HPCS 2014). IEEE, Bologna, Italy, 959–967.

Crossref

Google Scholar

[5]

Vinu E. Venugopal, Martin Theobald, Samira Chaychi, and Amal Tawakuli. 2020. AIR: A Light-Weight Yet High-Performance Dataflow Engine based on Asynchronous Iterative Routing. In 32nd IEEE International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2020. IEEE, 51–58.

Crossref

Google Scholar

[6]

Huan Zhang, Cho-Jui Hsieh, and Venkatesh Akella. 2016. Hogwild++: A new mechanism for decentralized asynchronous stochastic gradient descent. In 2016 IEEE 16th International Conference on Data Mining (ICDM). IEEE, 629–638.

Crossref

Google Scholar

Cited By

View all

Venugopal VTheobald MTassetti DChaychi STawakuli A(2022)Targeting a light-weight and multi-channel approach for distributed stream processingJournal of Parallel and Distributed Computing10.1016/j.jpdc.2022.04.022167:C(77-96)Online publication date: 1-Sep-2022
https://dl.acm.org/doi/10.1016/j.jpdc.2022.04.022

Index Terms

Convergence time analysis of Asynchronous Distributed Artificial Neural Networks
1. Theory of computation
  1. Theory and algorithms for application domains

Index terms have been assigned to the content through auto-classification.

Recommendations

Artificial neural networks: learning algorithms, performance evaluation, and applications
Performance evaluation of artificial neural networks for spatial data analysis

the artificial neural network training algorithm is implemented in MATLAB language. This implementation is focused on the network parameters in order to get the optimal architecture of the network that means (the optimal neural network is the network ...
Artificial Neural Networks for Corporation Credit Rating Analysis
ICNDS '09: Proceedings of the 2009 International Conference on Networking and Digital Society - Volume 01

In this study we are trying with the neural network model to make an effective analysis for corporation credit rating. A 12-25-1 three-layer feedforward neural network using the backpropagation and Levenberg-Marquardt algorithms has been used in the ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

CODS-COMAD '22: Proceedings of the 5th Joint International Conference on Data Science & Management of Data (9th ACM IKDD CODS and 27th COMAD)

January 2022

357 pages

ISBN:9781450385824

DOI:10.1145/3493700

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 January 2022

Check for updates

Qualifiers

Extended-abstract
Research
Refereed limited

Conference

CODS-COMAD 2022

Sponsor:

SIGGRAPH

CODS-COMAD 2022: 5th Joint International Conference on Data Science & Management of Data (9th ACM IKDD CODS and 27th COMAD)

January 8 - 10, 2022

Bangalore, India

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
90
Total Downloads

Downloads (Last 12 months)15
Downloads (Last 6 weeks)2

Reflects downloads up to 11 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Venugopal VTheobald MTassetti DChaychi STawakuli A(2022)Targeting a light-weight and multi-channel approach for distributed stream processingJournal of Parallel and Distributed Computing10.1016/j.jpdc.2022.04.022167:C(77-96)Online publication date: 1-Sep-2022
https://dl.acm.org/doi/10.1016/j.jpdc.2022.04.022

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Abstract

References

Cited By

Index Terms

Recommendations

Artificial neural networks: learning algorithms, performance evaluation, and applications

Performance evaluation of artificial neural networks for spatial data analysis

Artificial Neural Networks for Corporation Credit Rating Analysis

Comments

Information

Published In

Sponsors

Publisher

Publication History

Check for updates

Qualifiers

Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

HTML Format

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations