Abstract
In this paper, a novel multi-kernel support vector machine (MKSVM) combining global and local characteristics of the input data is proposed. Along with, a parameter tuning approach is developed using the fruit fly optimization (FFO), which is applied to stock market movement direction prediction problem. At first, factor analysis is used for identifying reduced key features called as factor scores from the raw stock index data which when applied to the model contributes to improvement in prediction performance. Subsequently, a hybrid kernel method combining local and global characteristics of input data is proposed, where polynomial is used for global kernel and radial basis function is utilized for local kernel. Additionally, FFO-based parameter tuning scheme is proposed to enhance the prediction performance further. Lastly, the evolving MKSVM with best feature subset and optimal parameters is used to predict stock market movement direction based upon historical data series. For evaluation and illustration purposes, three significant stock databases, NYSE, DJI and S&P 500 are used as testing targets. The effectiveness of this proposed approach is proved by three different stock market datasets, which demonstrate that the proposed approach outperforms the MKSVM with default parameter, MKSVM with PSO, MKSVM with GA and other methods. In addition, our findings reveal that the optimization strategy proposed here may be used as a promising choice forecasting tool for better generalization ability higher forecasting accuracy.
Similar content being viewed by others
References
Guo Y, Han S, Shen C, Li Y, Yin X, Bai Y (2018) An adaptive SVR for high-frequency stock price forecasting. IEEE Access 6:11397–11404
Wang W (2018) A big data framework for stock price forecasting using fuzzy time series. Multimed Tools Appl 77(8):10123–10134
Nahil A, Lyhyaoui A (2018) Short-term stock price forecasting using kernel principal component analysis and support vector machines: the case of Casablanca stock exchange. Procedia Comput Sci 127:61–169
Chen Y, Hao Y (2017) A feature weighted support vector machine and K-nearest neighbor algorithm for stock market indices prediction. Expert Syst Appl 80:340–355
Zahedi J, Rounaghi MM (2015) Application of artificial neural network models and principal component analysis method in predicting stock prices on Tehran Stock Exchange. Phys A 438:178–187
Goldberg DE (1989) Genetic algorithms in search, optimization and machine learning. Addison-Wesley, New York
Gowthul Alam MM, Baulkani S (2017) Reformulated query-based document retrieval using optimised kernel fuzzy clustering algorithm. Int J Bus Intell Data Min 12(3):299–318
Gowthul Alam MM, Baulkani S (2016) A hybrid approach for web document clustering using K-means and artificial bee colony algorithm. Int J Intell Eng Syst 9(4):11–20
Kennedy J (2007) The particle swarm as collaborative sampling of the search space. Adv Complex Syst 10:191–213
Wu J, Jin L, Liu M (2010) A hybrid support vector regression approach for rainfall forecasting using particle swarm optimization and projection pursuit technology. Int J Comput Intell Appl 9:87–104
Sundararaj V, Muthukumar S, Kumar RS (2018) An optimal cluster formation based energy efficient dynamic scheduling hybrid MAC protocol for heavy traffic load in wireless sensor network. Comput Secur 77:277–288
Bäck T, Hammel U, Schwefel HP (1997) Evolutionary computation: comments on the history and current state. IEEE Trans Evol Comput 1:3–17
Koziel S (1999) Evolutionary algorithms, homomorphous mappings, and constrained parameter optimization. Evol Comput 7:19–44
Pan WT (2012) A new fruit fly optimization algorithm: taking the financial distress model as an example. Knowl Syst 26:69–74
Lin F-L, Yang S-Y, Marsh T, Chen Y-F (2018) Stock and bond return relations and stock market uncertainty: evidence from wavelet analysis. Int Rev Econ Finance 55:285–294
Wang H-F, Kuo CY (2005) Factor analysis in data mining. Comput Math Appl 48:1765–1778
Hsia T-C, Hsu Y-L, Jen H-L (2009) A factor analysis based selection process for predicting successful university color guard club members. Expert Syst Appl 36:2719–2726
Kim H, Soibelman L, Grobler F (2008) Factor selection for delay analysis using knowledge discovery in databases. Autom Constr 17:550–560
Anish CM, Majhi B, Tonde HS (2014) A novel hybrid nonlinear adaptive model for prediction of stock indices. In: Proceedings of international conference on communication and computing, ICC-2014-Bangalore, pp 18–25
Chen AS, Leung MT, Daouk H (2003) Application of neural networks to an emerging financial market: forecasting and trading the Taiwan Stock Index. Comput Oper Res 30(6):901–923
Diler AI (2003) Predicting direction of ISE national-100 index with back propagation trained neural network. J Istanb Stock Exch 7(25–26):65–81
Altay E, Satman MH (2005) Stock market forecasting: artificial neural networks and linear regression comparison in an emerging market. J Financ Manag Anal 18(2):18–33
Cao Q, Leggio KB, Schniederjans MJA (2005) A comparison between Fama and French’s model and artificial neural networks in predicting the Chinese stock market. Comput Oper Res 32:2499–2512
Vapnik V (2000) The nature of statistical learning theory. Springer, Berlin, pp 863–884
Lins ID, Moura MDC, Zio E, Droguett EL (2012) A particle swarm-optimized support vector machine for reliability prediction. Qual Reliab Eng Int 28(2):141–158
Hu Y, Wu C, Liu H (2011) Prediction of passenger flow on the highway based on the least square support vector machine. Transport 26(2):197–203
Kim K (2003) Financial time series forecasting using support vector machines. Neuro Comput 55:307–319
Huang W, Nakamori Y, Wang SY (2005) Forecasting stock market movement direction with support vector machine. Comput Oper Res 32:2513–2522
Nahil A, Lyhyaoui A (2018) Short-term stock price forecasting using kernel principal component analysis and support vector machines: the case of Casablanca stock exchange. Procedia Comput Sci 127(2018):161–169
Hsu SH, Hsieh JJPA, Chih TC, Hsu KC (2009) A two-stage architecture for stock price forecasting by integrating self-organizing map and support vector regression. Expert Syst Appl 36(4):7947–7951
Li H, Guo S, Zhao H, Su C, Wang B (2012) Annual electric load forecasting by a least squares support vector machine with a fruit fly optimization algorithm. Energies 5(11):4430–4445
Chen P-W, Lin W-Y, Huang T-H, Pan W-T (2013) Using fruit fly optimization algorithm optimized grey model neural network to perform satisfaction analysis for e-business service. Appl Math Inf Sci 7(2):459–465
Li H-Z, Guo S, Li C-J, Sun J-Q (2013) A hybrid annual power load forecasting model based on generalized regression neural network with fruit fly optimization algorithm. Knowl Syst 37:378–387
Shan D, Cao G, Dong H (2013) LGMS-FOA: an improved fruit fly optimization algorithm for solving optimization problems. Math Probl Eng 9
Wang H-F, Kuo CY (2005) Factor analysis in data mining. Comput Math Appl 48:1765–1778
Sharma S, Kumar A (2006) Cluster analysis and factor analysis (chapter 18). Sage Publication, Thousand Oaks
Chen Y-S, Cheng C-H, Chiu C-L, Huang S-T (2016) A study of ANFIS-based multi-factor time series models for forecasting stock index. Appl Intell 45(2):277–292
Jenkins G, Doney (2008) Principal component and factor analysis. In: Modeling methods for marine science, pp 81–117
Pan WT (2012) A new fruit fly optimization algorithm: taking the financial distress model as an example. Knowl Syst 26:69–74
Kennedy J, Eberhart R (2001) Swarm intelligence. Morgan Kaufmann publishers Inc, San Francisco
Chen HL, Yang B, Liu J, Liu D-Y (2011) A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis. Expert Syst Appl 38(7):9014–9022
Yuan X, Tan Q, Lei X, Yuan Y, Wu X (2017) Wind power prediction using hybrid autoregressive fractionally integrated moving average and least square support vector machine. Energy 129:122–137
Maldonado S, Pérez J, Bravo C (2017) Cost-based feature selection for support vector machines: an application in credit scoring. Eur J Oper Res 261(2):656–665
Mehmet G, Ethem A (2010) Cost-conscious multiple kernel learning. Pattern Recognit Lett 31:959–965
Yuan SF, Chu FL (2007) Fault diagnosis based on support vector machines with parameter optimization by artificial immunization algorithm. Mech Syst Signal Process 21:1318–1330
Smits GF, Jordaan EM (2002) Improved SVM regression using mixtures of kernels. In: Proceedings of the 2002 international joint conference on neural networks, vol 3. IEEE, Honolulu, Hi, USA, pp 2785–2790
Chen L, Chen CLP, Lu M (2011) A multiple-kernel fuzzy c-means algorithm for image segmentation. IEEE Trans Syst Man Cybern 41(5):1263–1274
Hyndman RJ, Koehler AB (2006) Another look at measures of forecast accuracy. Int J Forecast 22(4):679–688
Chen Y, Hao Y (2017) A feature weighted support vector machine and K-nearest neighbor algorithm for stock market indices prediction. Expert Syst Appl 80:340–355
Javedani Sadaei H, Lee MH (2014) Multilayer stock forecasting model using fuzzy time series. Sci World J 2014:3
Cheng C-H, Chen T-L, Teoh HJ, Chiang CH (2008) Fuzzy time-series based on adaptive expectation model for TAIEX forecasting. Exp Syst Appl 34(2):1126–1132
Chen SM, Chen CD (2011) Handling forecasting problems based on high-order fuzzy logical relationships. Expert Syst Appl 38(4):3857–3864
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Gowthul Alam, M.M., Baulkani, S. Local and global characteristics-based kernel hybridization to increase optimal support vector machine performance for stock market prediction. Knowl Inf Syst 60, 971–1000 (2019). https://doi.org/10.1007/s10115-018-1263-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-018-1263-1