[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3459637.3482059acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper

Asterisk-Shaped Features for Tabular Data

Published: 30 October 2021 Publication History

Abstract

Data often accumulates in tabular format with many attribute items, and prediction using machine learning adds value to data for business. However, studies on machine learning for tabular data only input attribute values, which reduces accuracy. Therefore, we propose an inference method that inputs attribute values and values from aggregated tabular data that has varying attribute values for each attribute item. In an experiment, we compared our proposed method with AutoGluon-Tabular using AutoML benchmark datasets. Our proposed method achieved the highest accuracy for 21 out of 39 datasets.

References

[1]
Aaron van den Oord, et al. 2016. Wavenet: a generative for a raw audio. arXiv preprint arXiv: 1609.03499.
[2]
Fisher Yu and Vladlen Koltun. 2016. Multi-scale context aggregation by dilated convolutions. International Conference on Learning Representations (ICLR).
[3]
P. Werbos. 1990. Backpropagation through time: what it does and how to do it. In Proceedings of IEEE, Vol. 78, pp. 1550--1560.
[4]
Sercan O. Arik and Tomas Pfister. 2019. TabNet: Attentive interpretable tabular learning. arXiv preprint arXiv: 1908.07442.
[5]
Hadi S. Jomaa, et al. 2019. Dataset2vec: Learning dataset meta-features. arXiv preprint arXiv: 1905.11063.
[6]
Esteban Real, et al. 2020. AutoML-Zero: Evolving machine learning algorithms. International Conference on Machine Learning (PMLR).
[7]
Nick Erickson, et al. 2020. AutoGluon-Tabular: Robust and Accurate AutoML for structured data. AutoML Workshop at ICML.
[8]
Liudmila Prokhorenkova1, et al. 2018. CatBoost: unbiased boosting with categorical features. Conference on Neural Information Processing Systems (NeurIPS).
[9]
Du Tran, et al. 2015. Learning spatiotemporal features with 3D convolutional networks. International Conference on Computer Vision (ICCV), pp. 4489--4497.
[10]
Pieter Gijsbers, et al. 2019. An open source AutoML benchmark. AutoML Workshop at ICML.
[11]
Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. Nature 521.7553, pp. 436--444.
[12]
Karl Pearson. 1901. On lines and planes of closest fit to systems of points in space. Philosophical Magazine, Series 6, Vol. 2, No. 11, pp. 559--572.
[13]
Alex Graves, et al. 2007. Multi-dimensional recurrent neural networks. arXiv preprint arXiv: 0705.2011v1 [cs.AI].
[14]
Guolin Ke, et al. 2017. LightGBM: a highly efficient gradient boosting decision tree. Conference on Neural Information Processing Systems (NeurIPS).

Index Terms

  1. Asterisk-Shaped Features for Tabular Data

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '21: Proceedings of the 30th ACM International Conference on Information & Knowledge Management
    October 2021
    4966 pages
    ISBN:9781450384469
    DOI:10.1145/3459637
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 30 October 2021

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. asterisk-shaped features
    2. high dimensional data
    3. tabular data

    Qualifiers

    • Short-paper

    Conference

    CIKM '21
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 94
      Total Downloads
    • Downloads (Last 12 months)10
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 12 Dec 2024

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media