Abstract
All sort of organizations needs as many information about their target population. Public datasets provides one important source of this information. However, the use of these databases is very difficult due to the lack of cross-references.
In Spain, two main public databases are available: Population and Housing Censuses and Family Expenditure Surveys. Both of them are published by Spanish Statistical Institute. These two databases can not be joined due to the different aggregation level (FES contains information about families while PHC contains the same information but aggregated). Besides, national laws protects this information and makes difficult the use of the datasets.
work defines a new methodology for join the two datasets based on Genetic Algorithms. The approach proposed could be used in any case where data with different aggregation level need to be joined.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Holland, J.H.: Adaption in natural and artificial systems. The University of Michigan Press, Ann Harbor (1975)
INE, http://www.ine.es/daco/daco43/metodo_ecpf_trimestral.doc
INE (2005), http://www.ine.es
INE (2005), http://www.ine.es/censo2001/censo2001.htm
INE (2005), http://www.ine.es/daco/daco43/notecpf8597.htm
Larrañaga, P., Lozano, J.A.: Estimation of Distribution Algorithms. A New Tool for Evolutionary Computation. Kluwer Academic Publishers, Dordrecht (2001)
MOSAIC, http://www.business-strategies.co.uk/Content.asp? ArticleID=629 (1999)
Papadimitriou, C.H., Steiglitz, K.: Combinatorial Optimization: Algorithms and Complexity. Prentice-Hall, Englewood Cliffs (1982)
Kok, J.N., Van der Putten, P., Gupta, A.: Data fusion through statistical matching. Center for eBussiness@MIT (2002)
R Development Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria (2004) ISBN 3-900051-07-0.
Montes, C., Frutos, S., Menasalvas, E., Segovia, J.: Calculating economic indexes per household and censal section from official spanish databases. In: ECML/PKDD (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cubo, Ó., Robles, V., Segovia, J., Menasalvas, E. (2005). Using Genetic Algorithms to Improve Accuracy of Economical Indexes Prediction. In: Famili, A.F., Kok, J.N., Peña, J.M., Siebes, A., Feelders, A. (eds) Advances in Intelligent Data Analysis VI. IDA 2005. Lecture Notes in Computer Science, vol 3646. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11552253_6
Download citation
DOI: https://doi.org/10.1007/11552253_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28795-7
Online ISBN: 978-3-540-31926-9
eBook Packages: Computer ScienceComputer Science (R0)