Abstract
This paper describes the ongoing process of integrating approximately 3,000 predicate nouns into Port4NooJ, the Portuguese module for NooJ. The integration of these resources enables us to further extend the paraphrastic capabilities of eSPERTo paraphrasing system developed in the scope of a project with the same name. The integrated predicate nouns co-occur with the support verb fazer (do or make) and their syntactic and distributional properties are formalized in lexicon-grammar tables. These lexicon-grammar tables resulted in a standalone dictionary of predicate noun constructions and a few new grammars that can be used in paraphrase analysis and generation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
Support verbs constructions with the verb fazer are used both in written texts and spoken language, occurring extensively in Portuguese sentences and noun phrases. Sentences with support verb constructions are more frequent than sentences with their equivalent verbs. In fact, Barreiro (2009) points out that from a search on all sentences of the COMPARA parallel corpus (Frankenberg-Garcia and Santos 2003; Santos and Inácio 2006) where the infinitive form of fazer occurs with a noun or with a left modifier and a noun, 47\(\%\) of the times the occurrence is in a support verb construction.
- 2.
eSPERTo, which means ‘smart’ in Portuguese, stands for “System of Paraphrasing for Editing and Revision of Texts” (“Sistema de Parafraseamento para Edição e Revisão de Texto”).
- 3.
Each entry in the lexicon-grammar can map to more than one existing Port4NooJ entry.
- 4.
- 5.
- 6.
In order to properly look up for the nominal entries in Port4NooJ, we applied a simple rule that removes from words in Port4NooJ the first consonant in cç, ct and pt before comparing it to the nominal predicate in the lexicon-grammar. However, if there are derivation rules associated to the entry we still need to make sure that the rules do not introduce pre-agreement spelling.
- 7.
For now, the table that formalizes medical technical terms was left out for a second stage.
References
Baptista, J.: Sintaxe dos predicados nominais com ‘ser de’. Fundação Calouste Gulbenkian, Fundação para a Ciência e a Tecnologia, Lisboa (2005)
Barreiro, A.: Make it simple with paraphrases: automated paraphrasing for authoring aids and machine translation, Porto, Portugal (2009)
Carvalho, P.: Análise e Representação de Construções Adjectivais para Processamento Automático de Texto. Adjectivos Intransitivos Humanos. Ph.D. thesis, Universidade de Lisboa, Lisboa, Portugal (2007)
Casteleiro, J.M.: Sintaxe transformacional do adjetivo. INIC, Lisboa (1981)
Chacoto, L.: O Verbo Fazer em Construções Nominais Predicativas. Ph.D. thesis, Universidade do Algarve, Faro, Portugal (2005)
D’Agostino, E., Elia, A.: Il significato delle frasi: un continuum dalle frasi semplici alle forme polirematiche. In: AA. VV, Ai limiti del linguaggio, pp. 287–310. Laterza, Bari (1998)
Frankenberg-Garcia, A., Santos, D.: Introducing COMPARA: the Portuguese - English parallel corpus. In: Zanettin, F., Bernardini, S., Stewart, D. (eds.) Corpora in Translator Education, pp. 71–87. St. Jerome, Manchester (2003)
Gross, M.: Méthodes en syntaxe: régime des constructions complétives. Actualités scientifiques et industrielles. Hermann, Paris (1975)
Laporte, E., Voyatzi, S.: An electronic dictionary of French multiword adverbs. In: Language Resources and Evaluation Conference, Workshop Towards a Shared Task for Multiword Expressions, pp. 31–34 (2008)
Mota, C., Carvalho, P., Raposo, F., Barreiro, A.: Generating paraphrases of human intransitive adjective constructions with Port4NooJ. In: Okrut, T., Hetsevich, Y., Silberztein, M., Stanislavenka, H. (eds.) NooJ 2015. CCIS, vol. 607, pp. 107–122. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-42471-2_10
Ranchhod, E.: Sintaxe dos predicados nominais com estar. Linguística, INIC, Lisboa (1990)
Salkoff, M.: Automatic translation of support verb constructions. In: Proceedings of 13th Conference on Computational Linguistics, COLING 1990, vol. 3, pp. 243–246. ACL, Helsinki (1990)
Santos, D., Inácio, S.: Annotating COMPARA, a grammar-aware parallel corpus. In: Calzolari, N., et al. (eds.) Proceedings of 5th LREC, Genoa, Italy, pp. 1216–1221 (2006)
Silberztein, M.: Les groupes nominaux productifs et les noms composés lexicalisés. Lingvisticæ Investigationes 17(2), 405–425 (1993)
Silberztein, M.: Formalizing Natural Languages: The NooJ Approach, p. 346. Wiley, Hoboken (2016)
Acknowledgements
This research was supported by Fundação para a Ciência e Tecnologia (FCT), under exploratory project eSPERTo (Ref. EXPL/MHC-LIN/2260/2013). Anabela Barreiro was also funded by FCT through post-doctoral grant SFRH/BPD/91446 /2012. The authors would like to thank Max Silberztein for his prompt support and guidance with all matters related to NooJ.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Mota, C., Chacoto, L., Barreiro, A. (2018). Integrating the Lexicon-Grammar of Predicate Nouns with Support Verb fazer into Port4NooJ. In: Mbarki, S., Mourchid, M., Silberztein, M. (eds) Formalizing Natural Languages with NooJ and Its Natural Language Processing Applications. NooJ 2017. Communications in Computer and Information Science, vol 811. Springer, Cham. https://doi.org/10.1007/978-3-319-73420-0_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-73420-0_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-73419-4
Online ISBN: 978-3-319-73420-0
eBook Packages: Computer ScienceComputer Science (R0)