Abstract
In this paper we upgrade linear logistic regression and boosting to multi-instance data, where each example consists of a labeled bag of instances. This is done by connecting predictions for individual instances to a bag-level probability estimate by simple averaging and maximizing the likelihood at the bag level—in other words, by assuming that all instances contribute equally and independently to a bag’s label. We present empirical results for artificial data generated according to the underlying generative model that we assume, and also show that the two algorithms produce competitive results on the Musk benchmark datasets.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Dietterich, T.G., Lathrop, R.H., Lozano-Pérez, T.: Solving the multiple-instance problem with the axis-parallel rectangles. Artificial Intelligence 89(1-2), 31–71 (1997)
Maron, O.: Learning from Ambiguity. PhD thesis, MIT, United States (1998)
Freund, Y., Schapire, R.E.: Experiments with a new boosting algorithm. In: Proc of the 13th Int Conf. on Machine Learning, pp. 148–156. Morgan Kaufmann, San Francisco (1996)
Gill, P.E., Murray, W., Wright, M.H.: Practical Optimization. Academic Press, London (1981)
Zhou, Z.-H., Zhang, M.-L.: Ensembles of multi-instance learners. In: Proc of the 14th European Conf on Machine Learning, pp. 492–501. Springer, Heidelberg (2003)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)
Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression, a statistical view of boosting (with discussion). Annals of Statistics 28, 307–337 (2000)
Gärtner, T., Flach, P.A., Kowalczyk, A., Smola, A.J.: Multi-instance kernels. In: Proc of the 19th Int Conf. on Machine Learning, pp. 179–186. Morgan Kaufmann, San Francisco (2002)
Ramon, J., De Raedt, L.: Multi instance neural networks. In: Workshop at the 17th Int Conf on Machine Learning, Attribute-Value and Relational Learning: Crossing the Boundaries (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Xu, X., Frank, E. (2004). Logistic Regression and Boosting for Labeled Bags of Instances. In: Dai, H., Srikant, R., Zhang, C. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2004. Lecture Notes in Computer Science(), vol 3056. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24775-3_35
Download citation
DOI: https://doi.org/10.1007/978-3-540-24775-3_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22064-0
Online ISBN: 978-3-540-24775-3
eBook Packages: Springer Book Archive