[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/648036.744400guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Inferring Demographic Attributes of Anonymus Internet Users

Published: 15 August 1999 Publication History

Abstract

Today it is quite common for web page content to include an advertisement. Since advertisers often want to target their message to people with certain demographic attributes, the anonymity of Internet users poses a special problem for them. The purpose of the present research is to find an effective way to infer demographic information (e.g. gender, age or income) about people who use the Internet but for whom demographic information is not otherwise available. Our hope is to build a high quality database of demographic profiles covering a large segment of the Internet population without having to survey each individual Internet user. Though Internet users are largely anonymous, they nonetheless provide a certain amount of usage information. Usage information includes, but is not limited to, (a) search terms entered by the Internet user and (b) web pages accessed by the Internet user. In this paper, we describe an application of the Latent Semantic Analysis (LSA) [1] information retrieval technique to construct a vector space in which we can represent the usage data associated with each Internet user of interest. Subsequently, we show how the LSA vector space enables us to produce demographic inferences by supplying the input to a three layer neural model trained using the scaled conjugate gradient (SCG) method.

References

[1]
Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer, T. K., & Harshman, R. Indexing by latent semantic analysis. Journal of the American Society For Information Science, 41(6), 1990.
[2]
Landauer, T. K., & Dumais, S. T., How come you know so much? From practical problem to theory. In D. Hermann, C. McEvoy, M. Johnson, & P. Hertel (Eds.), Basic and applied memory: Memory in context. Mahwah, NJ: Erlbaum, 105-126, 1996.
[3]
Landauer, T. K., & Dumais, S. T., A solution to Plato's problem: The Latent Semantic Analysis theory of the acquisition, induction, and representation of knowledge. Psychological Review, 104, 211-240, 1997.
[4]
M. W. Berry et al., SVDPACKC: Version 1.0 User's Guide, Tech. Rep. CS-93-194, University of Tennessee, Knoxville, TN, October 1993.
[5]
N. Belkin and W. Croft. Retrieval techniques. In M. Williams, editor, Annual Review of Information Science and Technology (ARIST), volume 22, chapter 4, pages 109-145. Elsevier Science Publishers B.V., 1987.
[6]
G. Golub and C. Van Loan. Matrix Computations. Johns-Hopkins, Baltimore, Maryland, second edition, 1989.
[7]
Salton, G. (ed), The SMART Retrieval System - Experiments in Automatic Document Processing, Englewood Cliffs, New Jersey: Prentice-Hall, 1971.
[8]
William H. Press, Saul A. Teukolsky, William T. Vetterling, and Brian P. Flannery. Numerical Recipes in C: The Art of Scientific Computing. Cambridge University Press, Cambridge, 2nd edition, 1992.
[9]
A. Zell et al., Stuttgart Neural Network Simulator: User Manual Version 4.1, University of Stuttgart, 1995.
[10]
Dumais, S. T. (1995), Using LSI for information filtering: TREC-3 experiments. In: D. Harman (Ed.), The Third Text REtrieval Conference (TREC3) National Institute of Standards and Technology Special Publication, in press 1995.
[11]
Dumais, S. T., Improving the retrieval of information from external sources. Behavior Research Methods, Instruments and Computers, 23(2), 229-236, 1991.
[12]
Dumais, S. T., Furnas, G. W., Landauer, T. K. and Deerwester, S., Using latent semantic analysis to improve information retrieval. In Proceedings of CHI'88: Conference on Human Factors in Computing, New York: ACM, 281-285, 1988.
[13]
Dumais, S. T., "Latent Semantic Indexing (LSI) and TREC-2." In: D. Harman (Ed.), The Second Text REtrieval Conference (TREC2), National Institute of Standards and Technology Special Publication 500-215, pp. 105-116, 1994.
[14]
Dumais, S. T., "LSI meets TREC: A status report." In: D. Harman (Ed.), The First Text REtrieval Conference (TREC1), National Institute of Standards and Technology Special Publication 500-207, pp. 137-152, 1993.
[15]
Kaski, S., ì Dimensionality reduction by random mapping: Fast similarity computation for clustering.î In Proceedings of IJCNN'98, International Joint Conference on Neural Networks, volume 1, pages 413-418. IEEE Service Center, Piscataway, NJ., 1998.

Cited By

View all
  • (2021)Predicting Voting Behavior Using Digital Trace DataSocial Science Computer Review10.1177/089443931988289639:5(862-883)Online publication date: 1-Oct-2021
  • (2017)Inferring Demographics and Social Networks of Mobile Device Users on Campus From AP-TrajectoriesProceedings of the 26th International Conference on World Wide Web Companion10.1145/3041021.3054140(139-147)Online publication date: 3-Apr-2017
  • (2016)Your Cart tells YouProceedings of the Ninth ACM International Conference on Web Search and Data Mining10.1145/2835776.2835783(173-182)Online publication date: 8-Feb-2016
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
WEBKDD '99: Revised Papers from the International Workshop on Web Usage Analysis and User Profiling
August 1999
176 pages
ISBN:3540678182

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 15 August 1999

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 04 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2021)Predicting Voting Behavior Using Digital Trace DataSocial Science Computer Review10.1177/089443931988289639:5(862-883)Online publication date: 1-Oct-2021
  • (2017)Inferring Demographics and Social Networks of Mobile Device Users on Campus From AP-TrajectoriesProceedings of the 26th International Conference on World Wide Web Companion10.1145/3041021.3054140(139-147)Online publication date: 3-Apr-2017
  • (2016)Your Cart tells YouProceedings of the Ninth ACM International Conference on Web Search and Data Mining10.1145/2835776.2835783(173-182)Online publication date: 8-Feb-2016
  • (2015)You Are Where You GoProceedings of the Eighth ACM International Conference on Web Search and Data Mining10.1145/2684822.2685287(295-304)Online publication date: 2-Feb-2015
  • (2014)Am i more similar to my followers or followees?Proceedings of the 25th ACM conference on Hypertext and social media10.1145/2631775.2631828(200-205)Online publication date: 1-Sep-2014
  • (2014)Scalable learning of users' preferences using networked dataProceedings of the 25th ACM conference on Hypertext and social media10.1145/2631775.2631796(4-12)Online publication date: 1-Sep-2014
  • (2010)Predicting Website Audience Demographics forWeb Advertising Targeting Using Multi-Website Clickstream DataFundamenta Informaticae10.5555/1803672.180367798:1(49-70)Online publication date: 1-Jan-2010
  • (2003)Web page clustering using a self-organizing map of user navigation patternsDecision Support Systems10.1016/S0167-9236(02)00109-435:2(245-256)Online publication date: 1-May-2003
  • (2000)WEBKDD'99ACM SIGKDD Explorations Newsletter10.1145/846183.8462091:2(108-111)Online publication date: 1-Jan-2000

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media