More Web Proxy on the site http://driver.im/

research-article

Curriculum learning empowered reinforcement learning for graph-based portfolio management: : Performance optimization and comprehensive analysis

Author:

Abdullah Ali SalamaiAuthors Info & Claims

Volume 179, Issue C

https://doi.org/10.1016/j.neunet.2024.106537

Published: 21 November 2024 Publication History

Abstract

Portfolio management (PM) is a popular financial process that concerns the occasional reallocation of a particular quantity of capital into a portfolio of assets, with the main aim of maximizing profitability conditioned to a certain level of risk. Given the inherent dynamicity of stock exchanges and development for long-term performance, reinforcement learning (RL) has become a dominating solution for solving the problem of portfolio management in an automated and efficient manner. Nevertheless, the present RL-based PM methods just take into account the variations in prices of portfolio assets and the implications of price variations, while overlooking the significant relationships among different assets in the market, which are extremely valuable for managerial decisions. To close this gap, this paper introduces a novel deep model that combines two subnetworks; one to learn a temporal representation of historical prices using a refined temporal learner, while the other learns the relationships between different stocks in the market using a relation graph learner (RGL). Then, the above learners are integrated into the curriculum RL scheme for formulating the PM as a curriculum Markov Decision Process, in which an adaptive curriculum policy is presented to enable the agent to adaptively minimize risk value and maximize cumulative return. Proof-of-concept experiments are performed on data from three public stock indices (namely S&P500, NYSE, and NASDAQ), and the results demonstrate the efficiency of the proposed framework in improving the portfolio management performance over the competing RL solutions.

References

[1]

A.M. Aboussalah, C.G. Lee, Continuous control with Stacked Deep Dynamic Recurrent Reinforcement Learning for portfolio optimization, Expert Syst. Appl. (2020),.

Digital Library

[2]

Alam, R.; Arnob, R.I.; Alam, A.E., “An arima-lstm correlation coefficient based hybrid model for portfolio management of dhaka stock exchange,” 2020, doi: 10.1007/978-3-030-60036-5_15.

[3]

Q. Cai, C. Cui, Y. Xiong, W. Wang, Z. Xie, M. Zhang, A Survey on Deep Reinforcement Learning for Data Processing and Analytics, IEEE Trans. Knowl. Data Eng. (2022),.

Digital Library

[4]

Cao, H.K.; Cao, H.K.; Nguyen, B.T., “DELAFO: An Efficient Portfolio Optimization Using Deep Neural Networks,” 2020, doi: 10.1007/978-3-030-47426-3_48.

[5]

A. Charpentier, R. Élie, C. Remlinger, Reinforcement Learning in Economics and Finance, Comput. Econ. (2021),.

Digital Library

[6]

W. Chen, H. Zhang, M.K. Mehlawat, L. Jia, Mean–variance portfolio optimization using machine learning-based stock price prediction, Appl. Soft Comput. (2021),.

[7]

W. Chen, M. Jiang, W.G. Zhang, Z. Chen, A novel graph convolutional feature based convolutional neural network for stock trend prediction, Inf. Sci. (Ny). (2021),.

[8]

D. Cheng, F. Yang, S. Xiang, J. Liu, Financial time series forecasting with multi-modality graph neural network, Pattern Recognit 121 (2022),. Jan.

Digital Library

[9]

T.M. Cover, E. Ordentlich, Universal Portfolios with Side Information, IEEE Trans. Inf. Theory (1996),.

Digital Library

[10]

F. Feng, X. He, X. Wang, C. Luo, Y. Liu, T.S. Chua, Temporal relational ranking for stock prediction, ACM Trans. Inf. Syst. (2019),.

Digital Library

[11]

E. Hillebrand, A mean-reversion theory of stock-market crashes, 10th Annu. Meet. Ger. Financ. Assoc. (2003).

[12]

Hu, Z.; Dong, Y.; Wang, K.; Sun, Y., “Heterogeneous Graph Transformer,” 2020, doi: 10.1145/3366423.3380027.

[13]

W.C. Huang, C.T. Chen, C. Lee, F.H. Kuo, S.H. Huang, Attentive gated graph sequence neural network-based time-series information fusion for financial trading, Inf. Fusion 91 (2023) 261–276,. Mar.

Digital Library

[14]

K. Imajo, K. Minami, K. Ito, and K. Nakagawa, “Deep Portfolio Optimization via Distributional Prediction of Residual Factors,” Proc. AAAI Conf. Artif. Intell., vol. 35, no. 1, pp. 213–222, May 2021, doi: 10.1609/aaai.v35i1.16095.

[15]

G. Jeong, H.Y. Kim, Improving financial trading decisions using deep Q-learning: Predicting the number of shares, action strategies, and transfer learning, Expert Syst. Appl. 117 (2019) 125–138,. Mar.

[16]

Joshi, D.J.; Kale, I.; Gandewar, S.; Korate, O.; Patwari, D.; Patil, S., “Reinforcement Learning: A Survey,” 2021, doi: 10.1007/978-981-33-4859-2_29.

[17]

Kim, R.; So, C.H.; Jeong, M.; Lee, S.; Kim, J.; Kang, J., “HATS: A Hierarchical Graph Attention Network for Stock Movement Prediction,” Aug 2019, [Online]. Available: https://arxiv.org/abs/1908.07999.

[18]

H. Kim, Mean-variance portfolio optimization with stock return prediction using xgboost, Econ. Comput. Econ. Cybern. Stud. Res. (2021),.

[19]

Y. Kwak, J. Song, H. Lee, Neural network with fixed noise for index-tracking portfolio optimization, Expert Syst. Appl. (2021),.

Digital Library

[20]

B. Li, S.C.H. Hoi, V. Gopalkrishnan, CORN: Correlation-driven nonparametric learning approach for portfolio selection, ACM Trans. Intell. Syst. Technol. (2011),.

Digital Library

[21]

J. Li, Y. Zhang, X. Yang, L. Chen, Online portfolio management via deep reinforcement learning with high-frequency data, Inf. Process. Manag. 60 (3) (2023),. May.

Digital Library

[22]

X.Y. Liu, H. Yang, J. Gao, C. Wang, FinRL: Deep Reinforcement Learning Framework to Automate Trading in Quantitative Finance, SSRN Electron. J. (2021),.

[23]

Y. Ma, R. Han, W. Wang, Prediction-Based Portfolio Optimization Models Using Deep Neural Networks, IEEE Access (2020),.

[24]

Y. Ma, R. Han, W. Wang, Portfolio optimization with return prediction using deep learning and machine learning, Expert Syst. Appl. (2021),.

[25]

L. Malandri, F.Z. Xing, C. Orsenigo, C. Vercellis, E. Cambria, Public Mood–Driven Asset Allocation: the Importance of Financial Sentiment in Portfolio Management, Cognit. Comput. (2018),.

[26]

Narvekar, S.; Stone, P., “Learning curriculum policies for reinforcement learning,” 2019.

[27]

Niu, H.; Li, S.; Li, J., “MetaTrader: An Reinforcement Learning Approach Integrating Diverse Policies for Portfolio Optimization,” in Proceedings of the 31st ACM International Conference on Information & Knowledge Management, Oct 2022, pp. 1573–1583, doi: 10.1145/3511808.3557363.

[28]

Sawhney, R.; Agarwal, S.; Wadhwa, A.; Shah, R., “Exploring the scale-free nature of stock markets: Hyperbolic graph learning for algorithmic trading,” 2021, doi: 10.1145/3442381.3450095.

[29]

Schlichtkrull, M.; Kipf, T.N.; Bloem, P.; van den Berg, R.; Titov, I.; Welling, M., “Modeling Relational Data with Graph Convolutional Networks,” 2018, doi: 10.1007/978-3-319-93417-4_38.

[30]

S. Shi, J. Li, G. Li, P. Pan, Q. Chen, Q. Sun, GPM: A graph convolutional network based reinforcement learning framework for portfolio management, Neurocomputing 498 (2022) 14–27,. Aug.

Digital Library

[31]

F. Soleymani, E. Paquet, Financial portfolio optimization with online deep reinforcement learning and restricted stacked autoencoder—DeepBreath, Expert Syst. Appl. (2020),.

[32]

F. Soleymani, E. Paquet, Deep graph convolutional reinforcement learning for financial portfolio management – DeepPocket, Expert Syst. Appl. (2021),.

Digital Library

[33]

X. Song, et al., Volatility analysis with realized GARCH-Itô models, J. Econom. (2021),.

[34]

C. Song, F. Cai, M. Wang, J. Zheng, T. Shao, TaxonPrompt: Taxonomy-aware curriculum prompt learning for few-shot event classification, Knowledge-Based Syst. (2023),. Jan.

Digital Library

[35]

Sun, S.; et al., “DeepScalper,” in Proceedings of the 31st ACM International Conference on Information & Knowledge Management, Oct 2022, pp. 1858–1867, doi: 10.1145/3511808.3557283.

[36]

Sun, S.; et al., “DeepScalper: A Risk-Aware Reinforcement Learning Framework to Capture Fleeting Intraday Trading Opportunities,” in Proceedings of the 31st ACM International Conference on Information & Knowledge Management, Oct 2022, pp. 1858–1867, doi: 10.1145/3511808.3557283.

[37]

A. Thakkar, K. Chaudhari, Fusion in stock market prediction: A decade survey on the necessity, recent developments, and potential future directions, Inf. Fusion (2021),.

[38]

Vakil, N.; Amiri, H., “Generic and Trend-aware Curriculum Learning for Relation Extraction in Graph Neural Networks,” NAACL 2022 - 2022 Conf. North Am. Chapter Assoc. Comput. Linguist. Hum. Lang. Technol. Proc. Conf., pp. 2202–2213, 2022, doi: https://doi.org/10.48550/arXiv.2205.08625.

[39]

Vaswani, A.; et al., “Attention is all you need,” 2017.

[40]

Wang, R.; Wei, H.; An, B.; Feng, Z.; Yao, J., “Commission Fee is not Enough: A Hierarchical Reinforced Framework for Portfolio Management,” Proc. AAAI Conf. Artif. Intell., vol. 35, no. 1, pp. 626–633, May 2021, doi: 10.1609/aaai.v35i1.16142.

[41]

Z. Wang, B. Huang, S. Tu, K. Zhang, and L. Xu, “DeepTrader: A Deep Reinforcement Learning Approach for Risk-Return Balanced Portfolio Management with Market Conditions Embedding,” Proc. AAAI Conf. Artif. Intell., vol. 35, no. 1, pp. 643–650, May 2021, doi: 10.1609/aaai.v35i1.16144.

[42]

C. Wang, Different GARCH model analysis on returns and volatility in Bitcoin, Data Sci. Financ. Econ. (2021),.

[43]

Wen, Q.; et al., “Transformers in Time Series: A Survey,” Feb 2022, [Online]. Available: https://arxiv.org/abs/2202.07125.

[44]

Woo, G.; Liu, C.; Sahoo, D.; Kumar, A.; Hoi, S., “ETSformer: Exponential Smoothing Transformers for Time-series Forecasting,” Feb 2022, [Online]. Available: https://arxiv.org/abs/2202.01381.

[45]

Wu, X.; Jiang, M.; Liu, G., “R-GSN: The Relation-based Graph Similar Network for Heterogeneous Graph,” Mar 2021, [Online]. Available: https://arxiv.org/abs/2103.07877.

[46]

Xiang, S.; Cheng, D.; Shang, C.; Zhang, Y.; Liang, Y., “Temporal and Heterogeneous Graph Neural Network for Financial Time Series Prediction,” in Proceedings of the 31st ACM International Conference on Information & Knowledge Management, Oct 2022, pp. 3584–3593, doi: 10.1145/3511808.3557089.

[47]

Yang, C.; Wang, R.; Yao, S.; Liu, S.; Abdelzaher, T., “Revisiting Over-smoothing in Deep GCNs,” Mar 2020, [Online]. Available: https://arxiv.org/abs/2003.13663.

[48]

Y. Yang, Z. Guan, J. Li, W. Zhao, J. Cui, Q. Wang, Interpretable and Efficient Heterogeneous Graph Convolutional Network, IEEE Trans. Knowl. Data Eng. (2021),.

[49]

X. Yang, T. Burghardt, M. Mirmehdi, Dynamic Curriculum Learning for Great Ape Detection in the Wild, Int. J. Comput. Vis. (2023),. Jan.

Digital Library

[50]

Ye, Y.; et al., “Reinforcement-learning based portfolio management with augmented asset movement prediction states,” 2020, doi: 10.1609/aaai.v34i01.5462.

[51]

T. Yin, C. Liu, F. Ding, Z. Feng, B. Yuan, N. Zhang, Graph-based stock correlation and prediction for high-frequency trading systems, Pattern Recognit 122 (2022),. Feb.

Digital Library

[52]

H. Yun, M. Lee, Y.S. Kang, J. Seok, Portfolio management via two-stage deep learning with a joint cost, Expert Syst. Appl. (2020),.

Digital Library

[53]

A. Zeng, M. Chen, L. Zhang, and Q. Xu, “Are Transformers Effective for Time Series Forecasting?,” Proc. AAAI Conf. Artif. Intell., vol. 37, no. 9, pp. 11121–11128, Jun 2023, doi: 10.1609/aaai.v37i9.26317.

[54]

Z. Zhang, S. Zohren, S. Roberts, Deep Learning for Portfolio Optimisation, SSRN Electron. J. (2020),.

Index Terms

Curriculum learning empowered reinforcement learning for graph-based portfolio management: Performance optimization and comprehensive analysis

Index terms have been assigned to the content through auto-classification.

Recommendations

Portfolio dynamic trading strategies using deep reinforcement learning
Abstract
Using the constituent stocks of the iShares MSCI US ESG Select Index ETF, a matrix of technical indicators, returns, and covariance is incorporated to represent the inherent information characteristics of the stock market. In this study, based on ...
Multi-agent reinforcement learning approach for hedging portfolio problem
Abstract
Developing a hedging strategy to reduce risk of losses for a given set of stocks in a portfolio is a difficult task due to cost of the hedge. In Vietnam stock market, cross-hedge is involved hedging a long position of a stock because there is no ...
Portfolio management algorithm based on long-term prediction of assets
ICBDT '22: Proceedings of the 5th International Conference on Big Data Technologies

Sequence data prediction is widely used in many fields. One of the most typical applications is in the financial fields, e.g., it can be used to predict the prices of assets. In this paper, we propose a new portfolio management algorithm based on long-...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Neural Networks

Neural Networks Volume 179, Issue C

Nov 2024

1557 pages

Issue’s Table of Contents

Elsevier Ltd.

Publisher

Elsevier Science Ltd.

United Kingdom

Publication History

Published: 21 November 2024

Author Tags

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 14 Dec 2024

Other Metrics

View Author Metrics

Citations

View Options

View options

Media

Figures

Other

Tables

View Issue’s Table of Contents