Abstract
A methodology for sub-symbolic semantic encoding of words is presented. The methodology uses the standard, semantically highly-structured WordNet lexical database and the SemiDiscrete matrix Decomposition to obtain a vector representation with low memory requirements in a semantic n-space. The application of the proposed algorithm over all the WordNet words would lead to a useful tool for the sub-symbolic processing of texts.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bellegarda, J.R. (2000). Exploiting latent semantic information in statistical language modeling. Proceedings of the IEEE, 88:1279–1296.
Burgess, C. and Lund, K. (2000). The dynamics of meaning in memory. Cognitive dynamics: Conceptual and Representational Change in Humans and Machines.E. Dietrich and A. Markman, Hillsdale, N.J, Lawrence Erlbaum Associates.
Didion, J. (2002). Jwnl (java wordnet library). http://www.sourceforge.net.
Hofmann, T. (2000). Learning the similarity of documents: An information-geometric approach to document retrieval and categorization. Advances in Neural Information Processing Systems, S.A. Solla, T.K. Leen and K.R. Muller (eds, pages 914–920.
Honkela, T., Pulkki, V., and Kohonen., T. (1995). Contextual relations of words in grimm tales, analyzed by self-organizing map. Proceedings of International Conference on Artificial Neural Networks, ICANN-95., pages 3–7.
Kolda, T.G. and O’Leary., D.P. (2000). Computation and uses of the semidiscrete matrix decomposition. Trans. Math. Software.
Landauer, T.K., Foltz, P.W., and Laham., D. (1998). Introduction to latent semantic analysis. Discourse Processes, 25:259–284.
Miller, G.A., Beckwidth, R., Fellbaum, C., Gross, D., and Miller, K.J. (1990). Introduction to wordnet: An on-line lexical database. International Journal of Lexicography, 3:235–244.
Sahlgren, M., Karlgren, J., Cöster, R., and Järvinen, T. (2002). Sics at clef 2002: Automatic query expansion using random indexing. The CLEF 2002 Workshop, September 19–20, 2002, Rome, Italy.
Sebastiani, F. (2002). Machine learning in automated text categorization. ACM Computing Surveys, 34:1.
Siivola, V. (2000). Language modeling based on neural clustering of words. IDIAP-Com 02, Martigny, Switzerland.
Siolas, G. and d’Alche Buc, F. (2000). Support vector machines based on a semantic kernel for text categorization. Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks, IJCNN, 5:205–209.
Sloan Jr, K.R. and Tanimoto, S.L. (1979). Progressive refinement of raster images. IEEE Transactions on Computers, 28:871–874.
Vassallo, G., Pilato, G., Maggio, A., Puglisi, A., and Gaglio, S. (2003). Sub-symbolic encoding of words. Proc. of 8-th Congress of AI*IA, Lecture Notes in Artificial Intelligence, 2829:449–461.
Widdows, D., Cederberg, S., and Dorow, B. (2002). Visualisation techniques for analysing meaning. Fifth International Conference on Text, Speech and Dialogue, Brno, Czech Republic, pages 107–115.
Yang, H. and Lee, C. (2000). Automatic category generation for text documents by self-organizing maps. Proc. of IEEE-INNS-ENNS International Joint Conference on Neural Networks, 3:581–586.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer
About this paper
Cite this paper
Pilato, G., Vassallo, G., Gaglio, S. (2005). Wordnet and Semidiscrete Decomposition for Sub-Symbolic Representation of Words. In: Apolloni, B., Marinaro, M., Tagliaferri, R. (eds) Biological and Artificial Intelligence Environments. Springer, Dordrecht. https://doi.org/10.1007/1-4020-3432-6_23
Download citation
DOI: https://doi.org/10.1007/1-4020-3432-6_23
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-3431-2
Online ISBN: 978-1-4020-3432-9
eBook Packages: Computer ScienceComputer Science (R0)