[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to main content

Logistic Regression and Boosting for Labeled Bags of Instances

  • Conference paper
Advances in Knowledge Discovery and Data Mining (PAKDD 2004)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3056))

Included in the following conference series:

Abstract

In this paper we upgrade linear logistic regression and boosting to multi-instance data, where each example consists of a labeled bag of instances. This is done by connecting predictions for individual instances to a bag-level probability estimate by simple averaging and maximizing the likelihood at the bag level—in other words, by assuming that all instances contribute equally and independently to a bag’s label. We present empirical results for artificial data generated according to the underlying generative model that we assume, and also show that the two algorithms produce competitive results on the Musk benchmark datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Dietterich, T.G., Lathrop, R.H., Lozano-Pérez, T.: Solving the multiple-instance problem with the axis-parallel rectangles. Artificial Intelligence 89(1-2), 31–71 (1997)

    Article  MATH  Google Scholar 

  2. Maron, O.: Learning from Ambiguity. PhD thesis, MIT, United States (1998)

    Google Scholar 

  3. Freund, Y., Schapire, R.E.: Experiments with a new boosting algorithm. In: Proc of the 13th Int Conf. on Machine Learning, pp. 148–156. Morgan Kaufmann, San Francisco (1996)

    Google Scholar 

  4. Gill, P.E., Murray, W., Wright, M.H.: Practical Optimization. Academic Press, London (1981)

    MATH  Google Scholar 

  5. Zhou, Z.-H., Zhang, M.-L.: Ensembles of multi-instance learners. In: Proc of the 14th European Conf on Machine Learning, pp. 492–501. Springer, Heidelberg (2003)

    Google Scholar 

  6. Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)

    Google Scholar 

  7. Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression, a statistical view of boosting (with discussion). Annals of Statistics 28, 307–337 (2000)

    Article  MathSciNet  Google Scholar 

  8. Gärtner, T., Flach, P.A., Kowalczyk, A., Smola, A.J.: Multi-instance kernels. In: Proc of the 19th Int Conf. on Machine Learning, pp. 179–186. Morgan Kaufmann, San Francisco (2002)

    Google Scholar 

  9. Ramon, J., De Raedt, L.: Multi instance neural networks. In: Workshop at the 17th Int Conf on Machine Learning, Attribute-Value and Relational Learning: Crossing the Boundaries (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Xu, X., Frank, E. (2004). Logistic Regression and Boosting for Labeled Bags of Instances. In: Dai, H., Srikant, R., Zhang, C. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2004. Lecture Notes in Computer Science(), vol 3056. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24775-3_35

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-24775-3_35

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22064-0

  • Online ISBN: 978-3-540-24775-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics