[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

E-Commerce Product Categorization via Machine Translation

Published: 21 July 2020 Publication History

Abstract

E-commerce platforms categorize their products into a multi-level taxonomy tree with thousands of leaf categories. Conventional methods for product categorization are typically based on machine learning classification algorithms. These algorithms take product information as input (e.g., titles and descriptions) to classify a product into a leaf category. In this article, we propose a new paradigm based on machine translation. In our approach, we translate a product’s natural language description into a sequence of tokens representing a root-to-leaf path in a product taxonomy. In our experiments on two large real-world datasets, we show that our approach achieves better predictive accuracy than a state-of-the-art classification system for product categorization. In addition, we demonstrate that our machine translation models can propose meaningful new paths between previously unconnected nodes in a taxonomy tree, thereby transforming the taxonomy into a directed acyclic graph. We discuss how the resultant taxonomy directed acyclic graph promotes user-friendly navigation, and how it is more adaptable to new products.

References

[1]
Chris Anderson. 2006. The Long Tail. Hyperion.
[2]
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural machine translation by jointly learning to align and translate. In Proceedings of the 3rd International Conference on Learning Representations.
[3]
David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent Dirichlet allocation. Journal of Machine Learning Research 3 (Jan. 2003), 993--1022.
[4]
Ali Cevahir and Koji Murakami. 2016. Large-scale multi-class and hierarchical product categorization for an e-commerce giant. In Proceedings of the 26th International Conference on Computational Linguistics: Technical Papers (COLING’16). 525--535.
[5]
Jianfu Chen and David Warren. 2013. Cost-sensitive learning for large-scale hierarchical classification. In Proceedings of the 22nd ACM International Conference on Information and Knowledge Management. 1351--1360.
[6]
Kyunghyun Cho, Bart Van Merriënboer, Dzmitry Bahdanau, and Yoshua Bengio. 2014a. On the properties of neural machine translation: Encoder-decoder approaches. arXiv:1409.1259.
[7]
Kyunghyun Cho, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014b. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv:1406.1078.
[8]
Corinna Cortes and Vladimir Vapnik. 1995. Support-vector networks. Machine Learning 20, 3 (1995), 273--297.
[9]
Thomas Cover and Peter Hart. 1967. Nearest neighbor pattern classification. IEEE Transactions on Information Theory 13, 1 (1967), 21--27.
[10]
Pradipto Das, Yandi Xia, Aaron Levine, Giuseppe Di Fabbrizio, and Ankur Datta. 2016. Large-scale taxonomy categorization for noisy product listings. In Proceedings of the 2016 IEEE International Conference on Big Data (Big Data’16). IEEE, Los Alamitos, CA, 3885--3894.
[11]
Jerome H. Friedman. 2000. Greedy function approximation: A gradient boosting machine. Annals of Statistics 29 (2000), 1189--1232.
[12]
Ian Goodfellow, Yoshua Bengio, and Aaron Courville. 2016. Deep Learning. MIT Press. http://www.deeplearningbook.org.
[13]
Jung-Woo Ha, Hyuna Pyo, and Jeonghee Kim. 2016. Large-scale item categorization in e-commerce using multiple recurrent neural networks. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York, NY, 107--115.
[14]
Hany Hassan, Anthony Aue, Chang Chen, Vishal Chowdhary, Jonathan Clark, Christian Federmann, Xuedong Huang, et al. 2018. Achieving human parity on automatic Chinese to English news translation. arXiv:1803.05567.
[15]
Ruining He and Julian McAuley. 2016. Ups and downs: Modeling the visual evolution of fashion trends with one-class collaborative filtering. In Proceedings of the 25th International Conference on World Wide Web.
[16]
Evan Heit and Jacob Rubinstein. 1994. Similarity and property effects in inductive reasoning.Journal of Experimental Psychology. Learning, Memory, and Cognition 20 2 (1994), 411--22.
[17]
Geoffrey E. Hinton, Simon Osindero, and Yee-Whye Teh. 2006. A fast learning algorithm for deep belief nets. Neural Computation 18, 7 (2006), 1527--1554.
[18]
Geoffrey E. Hinton and Ruslan R. Salakhutdinov. 2006. Reducing the dimensionality of data with neural networks. Science 313, 5786 (2006), 504--507.
[19]
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural Computation 9 (1997), 1735--1780.
[20]
Nal Kalchbrenner and Phil Blunsom. 2013. Recurrent continuous translation models. In Proceedings of 2013 Conference on Empirical Methods in Natural Language Processing. 1700--1709.
[21]
Philipp Koehn. 2017. Neural machine translation. arXiv:1709.07809.
[22]
Zornitsa Kozareva. 2015. Everyone likes shopping! Multi-class product categorization for e-commerce. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT’15).
[23]
Taku Kudo, Kaoru Yamamoto, and Yuji Matsumoto. 2004. Applying conditional random fields to Japanese morphological analysis. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. 230--237.
[24]
Yann LeCun, Léon Bottou, and Patrick Haffner. 1998. Gradient-based learning applied to document recognition. Proceedings of the IEEE 86, 11 (1998), 2278–2324.
[25]
Yiu-Chang Lin, Pradipto Das, and Ankur Datta. 2018. Overview of the SIGIR 2018 eCom Rakuten Data Challenge. In Proceedings of the 2018 SIGIR Workshop on eCommerce (SIGIR eCom’18).
[26]
Thang Luong, Hieu Pham, and Christopher D. Manning. 2015. Effective approaches to attention-based neural machine translation. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. 1412--1421.
[27]
Julian J. McAuley, Rahul Pandey, and Jure Leskovec. 2015. Inferring networks of substitutable and complementary products. In Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York, NY, 785--794.
[28]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Gregory S. Corrado, and Jeffrey Dean. 2013. Distributed representations of words and phrases and their compositionality. In Proceedings of the 26th International Conference on Neural Information Processing Systems. 3111--3119.
[29]
Brian H. Ross and Gregory L. Murphy. 1999. Food for thought: Cross-classification and category organization in a complex real-world domain. Cognitive Psychology 38, 4 (1999), 495--553.
[30]
Patrick Shafto and John D. Coley. 2003. Development of categorization and reasoning in the natural world: Novices to experts, naive similarity to ecological knowledge. Journal of Experimental Psychology: Learning, Memory, and Cognition 29 4 (2003), 641--649.
[31]
Patrick Shafto, Charles Kemp, Elizabeth Baraff, John D. Coley, and Joshua B. Tenenbaum. 2005. Context-sensitive induction. In Proceedings of the 27th Annual Conference of the Cognitive Science Society. 2003--2008.
[32]
Dan Shen, Jean-David Ruvini, and Badrul Sarwar. 2012. Large-scale item categorization for e-commerce. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM’12). ACM, New York, NY, 595--604.
[33]
Chong Sun, Narasimhan Rampalli, Frank Yang, and AnHai Doan. 2014. Chimera: Large-scale classification using machine learning, rules, and crowdsourcing. Proceedings of the VLDB Endowment 7 (2014), 1529--1540.
[34]
Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to sequence learning with neural networks. In Proceedings of the 27th International Conference on Neural Information Processing Systems. 3104–3112,.
[35]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems 30, I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Eds.). Curran Associates, Red Hook, NY, 5998--6008. http://papers.nips.cc/paper/7181-attention-is-all-you-need.pdf.
[36]
Yandi Xia, Aaron Levine, Pradipto Das, Giuseppe Di Fabbrizio, Keiji Shinzato, and Ankur Datta. 2017. Large-scale categorization of Japanese product titles using neural attention models. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers.
[37]
Hsiang-Fu Yu, Chia-Hua Ho, Prakash Arunachalam, Manas Somaiya, and Chih-Jen Lin. 2013. Product Title Classification Versus Text Classification. Technical Report. Department of Computer Science, National Taiwan University, Taipei, Taiwan.

Cited By

View all
  • (2024)Automatic classification of products in P2P marketplacesJournal of Intelligent & Fuzzy Systems10.3233/JIFS-219344(1-11)Online publication date: 23-Mar-2024
  • (2024)TagRec++: Hierarchical Label Aware Attention Network for Question CategorizationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.335450436:7(3529-3540)Online publication date: 16-Jan-2024
  • (2024)A Flat-Hierarchical Approach Based on Machine Learning Model for e-Commerce Product ClassificationIEEE Access10.1109/ACCESS.2024.340069312(72730-72745)Online publication date: 2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Management Information Systems
ACM Transactions on Management Information Systems  Volume 11, Issue 3
Special Section on WITS 2018 and Regular Articles
September 2020
140 pages
ISSN:2158-656X
EISSN:2158-6578
DOI:10.1145/3407737
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 July 2020
Online AM: 07 May 2020
Accepted: 01 February 2020
Revised: 01 October 2019
Received: 01 May 2019
Published in TMIS Volume 11, Issue 3

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. E-commerce
  2. classification
  3. machine translation

Qualifiers

  • Research-article
  • Research
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)85
  • Downloads (Last 6 weeks)7
Reflects downloads up to 13 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Automatic classification of products in P2P marketplacesJournal of Intelligent & Fuzzy Systems10.3233/JIFS-219344(1-11)Online publication date: 23-Mar-2024
  • (2024)TagRec++: Hierarchical Label Aware Attention Network for Question CategorizationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.335450436:7(3529-3540)Online publication date: 16-Jan-2024
  • (2024)A Flat-Hierarchical Approach Based on Machine Learning Model for e-Commerce Product ClassificationIEEE Access10.1109/ACCESS.2024.340069312(72730-72745)Online publication date: 2024
  • (2023)Large Language Models versus Natural Language Understanding and GenerationProceedings of the 27th Pan-Hellenic Conference on Progress in Computing and Informatics10.1145/3635059.3635104(278-290)Online publication date: 24-Nov-2023
  • (2023)TME: Tree-guided Multi-task Embedding Learning towards Semantic Venue AnnotationACM Transactions on Information Systems10.1145/358255341:4(1-24)Online publication date: 8-Apr-2023
  • (2023)E-Commerce: Reach Customers and Drive Sales with Data Science and Big Data Analytics2023 2nd International Conference for Innovation in Technology (INOCON)10.1109/INOCON57975.2023.10101132(1-6)Online publication date: 3-Mar-2023
  • (2023)Construction and Applications of Billion-Scale Pre-Trained Multimodal Business Knowledge Graph2023 IEEE 39th International Conference on Data Engineering (ICDE)10.1109/ICDE55515.2023.00229(2988-3002)Online publication date: Apr-2023
  • (2022)Clasificación automática de publicaciones de productos en sitio de comercio electrónico2022 IEEE Biennial Congress of Argentina (ARGENCON)10.1109/ARGENCON55245.2022.9939985(1-8)Online publication date: 7-Sep-2022
  • (2022)Explainable Automatic Industrial Carbon Footprint Estimation From Bank Transaction Classification Using Natural Language ProcessingIEEE Access10.1109/ACCESS.2022.322632410(126326-126338)Online publication date: 2022
  • (2021)Quality Analysis of Shopee Seller Portal by Using Category Recommendation System Approach2021 19th International Conference on ICT and Knowledge Engineering (ICT&KE)10.1109/ICTKE52386.2021.9665701(1-4)Online publication date: 24-Nov-2021
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media