research-article

Public Access

AutoShard: Automated Embedding Table Sharding for Recommender Systems

Authors:

Xia HuAuthors Info & Claims

KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Pages 4461 - 4471

https://doi.org/10.1145/3534678.3539034

Published: 14 August 2022 Publication History

PDF eReader

Abstract

Embedding learning is an important technique in deep recommendation models to map categorical features to dense vectors. However, the embedding tables often demand an extremely large number of parameters, which become the storage and efficiency bottlenecks. Distributed training solutions have been adopted to partition the embedding tables into multiple devices. However, the embedding tables can easily lead to imbalances if not carefully partitioned. This is a significant design challenge of distributed systems named embedding table sharding, i.e., how we should partition the embedding tables to balance the costs across devices, which is a non-trivial task because 1) it is hard to efficiently and precisely measure the cost, and 2) the partition problem is known to be NP-hard. In this work, we introduce our novel practice in Meta, namely AutoShard, which uses a neural cost model to directly predict the multi-table costs and leverages deep reinforcement learning to solve the partition problem. Experimental results on an open-sourced large-scale synthetic dataset and Meta's production dataset demonstrate the superiority of AutoShard over the heuristics. Moreover, the learned policy of AutoShard can transfer to sharding tasks with various numbers of tables and different ratios of the unseen tables without any fine-tuning. Furthermore, AutoShard can efficiently shard hundreds of tables in seconds. The effectiveness, transferability, and efficiency of AutoShard make it desirable for production use. Our algorithms have been deployed in Meta production environment. A prototype is available at https://github.com/daochenzha/autoshard

Supplemental Material

MP4 File

The video presents our novel practice in Meta, namely AutoShard for embedding table sharding. We introduce how we approach a load balance by using a neural cost model to predict the embedding costs and leveraging deep reinforcement learning to solve the partition problem. A prototype is open-sourced at https://github.com/daochenzha/autoshard

Download
22.71 MB

References

[1]

[n.d.]. Amazon DSSTNE: Deep Scalable Sparse Tensor Network Engine. https://github.com/amazon-archives/amazon-dsstne.

Abstract

Supplemental Material

References

Cited By

Index Terms

Recommendations

Continuous Input Embedding Size Search For Recommender Systems

Automated Embedding Size Search in Deep Recommender Systems

Learning Personalizable Clustered Embedding for Recommender Systems

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Login options

Full Access

Share

Share this Publication link

Share on social media

Affiliations