Abstract
The use of ensemble models in many problem domains has increased significantly in the last fewyears. The ensemble modeling, in particularly boosting, has shown a great promise in improving predictive performance of a model. Combining the ensemble members is normally done in a co-operative fashion where each of the ensemble members performs the same task and their predictions are aggregated to obtain the improved performance. However, it is also possible to combine the ensemble members in a competitive fashion where the best prediction of a relevant ensemble member is selected for a particular input. This option has been previously somewhat overlooked. The aim of this article is to investigate and compare the competitive and co-operative approaches to combining the models in the ensemble. A comparison is made between a competitive ensemble model and that of MARS with bagging, mixture of experts, hierarchical mixture of experts and a neural network ensemble over several public domain regression problems that have a high degree of nonlinearity and noise. The empirical results showa substantial advantage of competitive learning versus the co-operative learning for all the regression problems investigated. The requirements for creating the efficient ensembles and the available guidelines are also discussed.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Barnett, J. A. “Computational methods for a mathematical theory of evidence”, Proceedings of IJCAI, pp. 868–875, 1981.
Bates, J. M. and C. W. J. Granger. “The combination of forecasts”. Operations Research Quaterly, 20:451–468, 1969.
Bauer, E. and Kohavi, R. “An empirical comparison of voting classification algorithms: bagging, boosting and variants”. Machine Learning, 36(1,2), 105–139, 1999.
Baxt, W. G. “Improving the accuracy of an artificial neural network using multiple differently trained networks”. Neural Computation, 4:772–780, 1992.
Breiman, L. “Bagging predictors”. Machine Learning, 26(2):123–140, 1996.
Brooks, R. A. “A robust layered control system for a mobile robot”. IEEE Journal of Robotics and Automation, 2:14–23, 1986.
Catfolis, T. and Meert, K. “Hybridization and specialization of real-time recurrent learning-based neural networks”, Connectionist Science, 9(1):51–70, 1997.
Denker, J., Schwartz, D., Wittner, B., Solla, S., Howard, R., Jackel, L. and Hopfield, J. “Large automatic learning, rule extraction and generalisation”. Complex Systems, 1:877–922, 1987.
Drucker, H. “Improving regressors using boosting techniques”. Proceedings of the 14th International Conference on Machine Learning, pp. 107–115, 1997.
Frayman, Y., Rolfe B. F., Hodgson, P. D. and Webb G. I. “Predicting the rolling force in hot steel rolling mill using an ensemble model”. Proceedings of the IASTED International Conference on Artificial Intelligence and Applications (AIA 2002), 2002. (in press).
Freund, Y. and R. Schapire. “A decision-theoretic generalization of on-line learning and an application to boosting”. Journal of Computer and System Sciences, 55(1):119–139, 1997.
Friedman J. “Multivariate adaptive regression splines (with discussion)”. Annals of Statistics, 19(1), 1–82, 1991.
Friedman, J., Hastie, T., and Tibshirani, R. “Additive logistic regression: a statistical view of boosting (with discussion)”, Annals of Statistics, 28(2), 337–374, 2000.
Friedman, J. “Greedy function approximation: a gradient boosting machine”. Annals of Statistics, 29(4). 2001.
Hashem, S. “Optimal linear combinations of neural networks”. Neural Networks, 10(4):599–614, 1997.
Jacobs, R. A, Jordan, M. I., Nowlan, S. J., and Hinton, G. E. “Adaptive mixtures of local experts”. Neural Computation, 3:79–97, 1991.
Jordan, M. I. and Jacobs R. A. “Hierarchical mixtures of experts and the em algorithm”. Neural Computation, 6(2):181–214, 1994.
Ridgeway, G. “The state of boosting”. Computing Science and Statistics, 31:172–7181, 1999.
Rogova, G. “Combining the results of several neural network classifiers”. Neural Networks, 7(5):777–781, 1994.
Schapire, R. E. “The strength of weak learnability”. Machine Learning, 5:197–227, 1990.
Sharkey, A.J.C. (Ed.) Combining artificial neural nets: ensemble and modular multi-net systems, Springer-Verlag, 1999.
Ting, K. M. “The characterisation of predictive accuracy and decision combination”. Proceedings of the 13th International Conference on Machine Learning, pp. 498–506, 1996.
Webb, G. “MultiBoosting: a technique for combining boosting andwagging”. Machine Learning, 40(2): 159–196, 2000.
Wolpert, D.H. “Stacked generalization”. Neural Networks, 5:241–259, 1992.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Frayman, Y., Rolfe, B.F., Webb, G.I. (2002). Solving Regression Problems Using Competitive Ensemble Models. In: McKay, B., Slaney, J. (eds) AI 2002: Advances in Artificial Intelligence. AI 2002. Lecture Notes in Computer Science(), vol 2557. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36187-1_45
Download citation
DOI: https://doi.org/10.1007/3-540-36187-1_45
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00197-3
Online ISBN: 978-3-540-36187-9
eBook Packages: Springer Book Archive