More Web Proxy on the site http://driver.im/

research-article

Leveraging Rough Set Theory to Enhance the Performance of Financial Statement Fraud Detection Model

Authors: Chengzhi Niu, Yougan Zhu, Hong He, Boyang Li, Yijia RenAuthors Info & Claims

CAIBDA '24: Proceedings of the 2024 4th International Conference on Artificial Intelligence, Big Data and Algorithms

Pages 153 - 160

https://doi.org/10.1145/3690407.3690433

Published: 24 October 2024 Publication History

Abstract

In this study, we have developed a hybrid financial statement fraud detection model by combining rough set theory and ensemble learning. In this research, we have developed a pre-processing filter utilizing rough set theory to assist the model in selecting the most appropriate features. Additionally, we have also incorporated a feature extracted from the text information of the Management Discussion and Analysis (MD&A) section in the financial report, which effectively captures the positive tone expressed by the management. To address the challenge of imbalanced classes in fraud detection, we have applied the Synthetic Minority Oversampling Technique (SMOTE) algorithm. The optimization procedures significantly improved the performance of our model. We have compared our model to other regular machine learning methods and observed its superiority. The results demonstrate that the integration of rough set theory attribute reduction algorithms has substantially enhanced the performance of the ensemble learning model. Furthermore, the inclusion of the feature extracted from text information has proven to be effective in detecting financial fraud behaviors.

References

[1]

Baesens, B., S. Hoppner, and T. Verdonck, 2021 Data engineering for fraud detection. Decision Support Systems. 150: p. 13.

Digital Library

[2]

Hu, N., L. Liu, and V. Sambamurthy, 2011 Fraud detection in online consumer reviews. Decision Support Systems. 50(3): p. 614-626.

Digital Library

[3]

Pourhabibi, T., et al., 2020 Fraud detection: A systematic literature review of graph-based anomaly detection approaches. Decision Support Systems. 133: p. 15.

Digital Library

[4]

Dilla, W.N. and R.L. Raschke, 2015 Data visualization for fraud detection: Practice implications and a call for future research. International Journal of Accounting Information Systems. 16: p. 1-22.

[5]

Papik, M. and L. Papikova, 2022 Detecting accounting fraud in companies reporting under US GAAP through data mining. International Journal of Accounting Information Systems. 45: p. 19.

[6]

van Capelleveen, G., et al., 2016 Outlier detection in healthcare fraud: A case study in the Medicaid dental domain. International Journal of Accounting Information Systems. 21: p. 18-31.

[7]

Choi, D. and K. Lee, 2018 An Artificial Intelligence Approach to Financial Fraud Detection under IoT Environment: A Survey and Implementation. Security and Communication Networks.

[8]

Omidi, M., et al., 2019 The Efficacy of Predictive Methods in Financial Statement Fraud. Discrete Dynamics in Nature and Society. 2019.

[9]

Ravisankar, P., et al., 2011 Detection of financial statement fraud and feature selection using data mining techniques. Decision Support Systems. 50(2): p. 491-500.

Digital Library

[10]

Whiting, D.G., et al., 2012 MACHINE LEARNING METHODS FOR DETECTING PATTERNS OF MANAGEMENT FRAUD. Computational Intelligence. 28(4): p. 505-527.

Digital Library

[11]

Chen, S., 2016 Detection of fraudulent financial statements using the hybrid data mining approach. SpringerPlus. 5(1): p. 89.

[12]

Hajek, P. and R. Henriques, 2017 Mining corporate annual reports for intelligent detection of financial statement fraud – A comparative study of machine learning methods. Knowledge-Based Systems. 128: p. 139-152.

Digital Library

[13]

Bao, Y., et al., 2020 Detecting Accounting Fraud in Publicly Traded U.S. Firms Using a Machine Learning Approach. Journal of Accounting Research. 58(1): p. 199-235.

[14]

Hajek, P. and R. Henriques, 2017 Mining corporate annual reports for intelligent detection of financial statement fraud - A comparative study of machine learning methods. Knowledge-Based Systems. 128: p. 139-152.

Digital Library

[15]

Perols, J.L., et al., 2017 Finding Needles in a Haystack: Using Data Analytics to Improve Fraud Prediction. Accounting Review. 92(2): p. 221-245.

[16]

Kim, Y.J., B. Baik, and S. Cho, 2016 Detecting financial misstatements with fraud intention using multi-class cost-sensitive learning. Expert Systems with Applications. 62: p. 32-43.

Digital Library

[17]

Zhou, W. and G. Kapoor, 2011 Detecting evolutionary financial statement fraud. Decision Support Systems. 50(3): p. 570-575.

Digital Library

[18]

Chawla, N.V., et al., 2002 SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research. 16: p. 321-357.

[19]

Pawlak, Z., 1982 Rough sets. International Journal of Computer & Information Sciences. 11(5): p. 341-356.

[20]

Chouchoulas, A. and Q. Shen, 2001 Rough set-aided keyword reduction for text categorization. Applied Artificial Intelligence. 15(9): p. 843-873.

[21]

Shen, Q. and A. Chouchoulas, 1999 Combining rough sets and data-driven fuzzy learning for generation of classification rules. Pattern Recognition. 32(12): p. 2073-2076.

[22]

BAO, Y., et al., 2020 Detecting Accounting Fraud in Publicly Traded U.S. Firms Using a Machine Learning Approach. Journal of Accounting Research. 58(1): p. 199-235.

[23]

Ngai, E.W.T., et al., 2011 The application of data mining techniques in financial fraud detection: A classification framework and an academic review of literature. Decision Support Systems. 50(3): p. 559-569.

Digital Library

[24]

Chen, Y.S. and Z.J. Wu, 2023 Financial Fraud Detection of Listed Companies in China: A Machine Learning Approach. Sustainability. 15(1).

Index Terms

Leveraging Rough Set Theory to Enhance the Performance of Financial Statement Fraud Detection Model
1. General and reference
  1. Document types
    1. Reference works

Recommendations

Rough Mereology: A Rough Set Paradigm for Unifying Rough Set Theory and Fuzzy Set Theory

In this work, we would like to discuss rough inclusions defined in Rough Mereology - a paradigm for approximate reasoning introduced by Polkowski and Skowron [20] - as a basis for common models for rough as well as fuzzy set theories. We would like to ...
Comparative study of variable precision rough set model and graded rough set model

The variable precision rough set model and graded rough set model are 2 important extended rough set models. This paper aims to make a comparative study of the 2 models. Rough set regions, primitive notions, are proposed first for the 2 models, which ...
A rough set paradigm for unifying rough set theory and fuzzy set theory
RSFDGrC'03: Proceedings of the 9th international conference on Rough sets, fuzzy sets, data mining, and granular computing

In this plenary address, we would like to discuss rough inclusions defined in Rough Mereology, a joint idea with A. Skowron, as a basis for common models for rough as well as fuzzy set theories. We would like to justify the point of view that tolerance (...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

CAIBDA '24: Proceedings of the 2024 4th International Conference on Artificial Intelligence, Big Data and Algorithms

June 2024

1206 pages

ISBN:9798400710247

DOI:10.1145/3690407

Copyright © 2024 Copyright held by the owner/author(s). Publication rights licensed to ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 October 2024

Check for updates

Author Tags

Qualifiers

Research-article

Conference

CAIBDA 2024

CAIBDA 2024: 2024 4th International Conference on Artificial Intelligence, Big Data and Algorithms

June 21 - 23, 2024

Zhengzhou, China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
14
Total Downloads

Downloads (Last 12 months)14
Downloads (Last 6 weeks)14

Reflects downloads up to 11 Dec 2024

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Full Text

View this article in Full Text.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View full text|Download PDF

View Table of Contents