CISUC

PAPEL: a dictionary-based lexical ontology for Portuguese

Authors

Abstract

This paper describes a project aimed at creating a lexical
ontology extracted (semi) automatically from a large Portuguese general
dictionary. Although using machine readable dictionaries to extract semantic
information is not new, we believe this is the first attempt for
the Portuguese language. The paper describes a (to be) freely available
resource, dubbed PAPEL, explaining the process used and the tools developed,
and illustrating it with one specific relation: Causation.

Keywords

lexical ontology dictionary automatic extraction semantic relations

Subject

Natural Language Processing

Related Project

PAPEL - Palavras Associadas Porto Editora Linguateca

Conference

PROPOR 2008 - International Conference on Computational Processing of Portuguese, September 2008


Cited by

Year 2016 : 1 citations

 Reis, S. and Baptista, J. (2016). Let's play wit proverbs? nlp tools and resources for icall applications around proverbs for pf. In Proceedings of the International Congress on Interdisciplinarity in Social and Human Sciences, pages 435–454. Research Centre for Spatial and Organizational Dynamics, University of Algarve.

Year 2015 : 5 citations

 Livy Real and Fabricio Chalub and Valeria de Paiva and Cláudia Freitas and Alexandre Rademaker (2015). Seeing is correcting: curating lexical resources using social interfaces. In Proceedings of the 4th Workshop on Linked Data in Linguistics: Resources and Applications, pages 20–29, Beijing, China. ACL Press.

 Simões, A. and Almeida, J. J. (2015). Experiments on enlarging a lexical ontology. In Languages, Applications and Technologies – Revised Selected Papers of 4th International Symposium SLATE 2015, Madrid, Spain, June, CCIS, pages 49–56. Springer.

 Markov, I., Mamede, N., and Baptista, J. (2015). A rule-based meronymy extraction module for Portuguese. Computación y Systemas, 19(4):661–683.

 Rita Policarpo (2015). Semantic classification of nouns. Master’s thesis, Instituto Superior Técnico.

 Marcos Garcia and Pablo Gamallo. Exploring the effectiveness of linguistic knowledge for biographical relation extraction. Natural Language Engineering, Volume 21, Issue 4, pp 519-551. 2015.

Year 2014 : 7 citations

 Ilia Markov, Nuno Mamede, Jorge Baptista. Body-Part Nouns and Whole-Part Relations in Portuguese. Proceedings of International Conference on the Computational Processing of the Portuguese Language (PROPOR 2014), LNCS, Volume 8775, pp 125-136. Springer, 2014.

 Cláudia Freitas, Valeria de Paiva, Alexandre Rademaker, Gerard de Melo, Livy Real, Anne Silva. Extending a Lexicon of Portuguese Nominalizations with Data from Corpora. Proceedings of International Conference on the Computational Processing of the Portuguese Language (PROPOR 2014), LNCS, Volume 8775, pp 114-124. Springer 2014.

 Brett Drury, Paula C.F. Cardoso, Janie M. Thomas, Alneu de Andrade Lopes. Lexical Resources for the Identification of Causative Relations in Portuguese Texts. Proceedings of the 1st Workshop on Tools and Resources for Automatically Processing Portuguese and Spanish (ToRPorEsp), pp 56-63. São Carlos, SP, Brasil. 2014

 Ilia Markov, Nuno Mamede, Jorge Baptista. Whole-part relations rule-based automatic identification: Issues from fine-grained error analysis. In Proceedings of 13th Mexican International Conference on Artificial Intelligence, Part I, volume 8856 of LNCS, pages 37–50. Springer. 2014.

 Ilia Markov (2014). Automatic identification of whole-part relations in portuguese. Master’s thesis, Universidade do Algarve.

 Ilia Markov, Nuno Mamede, Jorge Baptista. Automatic Identification of Whole-Part Relations in Portuguese. Proceedings of 3rd Symposium on Languages, Applications and Technologies (SLATE 2014). OASIcs, 2014.

 Marcos Garcia (2014). Extracção de Relações Semânticas. Recursos, Ferramentas e Estratégias. PhD thesis, Universidade de Santiago de Compostela.

Year 2013 : 2 citations

 Sandra Collovini de Abreu, Tiago Luis Bonamigo, Renata Vieira. A review on Relation Extraction with an eye on Portuguese. Journal of the Brazilian Computer Society. Springer. July 2013

 Carolina Evaristo Scarton. VerbNet. Br: construção semiautomática de um léxico verbal online e independente de domínio para o português do Brasil. Tese de Mestrado. USP - São Carlos. Março 2013.

Year 2012 : 2 citations

 Jorge O. Santos, Ana O. Alves, Francisco C. Pereira, Pedro H. Abreu. Semantic Enrichment of Places for the Portuguese Language. Actas do Inforum 2012, Simpósio de Informática, Lisboa, Portugal.

 Leonardo Zilio, Kassius Prestes, Rodrigo Wilkens, Aline Villavicencio. Geração semi-automática de uma ontologia geral de língua comum. XI Encontro de Linguística de Corpus (ELC), São Carlos, SP, Brasil, 2012.

Year 2011 : 5 citations

 José Saias and Paulo Quaresma. Semantic networks and spreading activation process for QA improvement on text answers. In Proceedings of the 8th Brazilian Symposium in Information and Human Language Technology - STIL2011, Mato Grosso, Brasil, 2011.

 Kassius Prestes and Rodrigo Wilkens and Leonardo Zillio and Aline Villavicencio. Extração e Validação de Ontologias a partir de Recursos Digitais. Proceedings of the 6th International Workshop on Metamodels, Ontologies and Semantic Technologies, pp. 183--188. Gramado, Brazil 2011

 Kassius Vargas Prestes. Avaliação de métodos de seleção da resposta em um sistema de perguntas e respostas. Projeto de Diplomação. Universidade Federal do Rio Grande do Sul, Brazil, 2011

 Cristiano Jose Lopes Marques. Syntactic REAP.PT. Master thesis. Universidade Técnica de Lisboa 2011

 J Saias, Contextualização e Activação Semântica na Selecção de Resultados em Sistemas de Pergunta-Resposta. Tese de doutoramento. Universidade de Évora 2011

Year 2010 : 6 citations

 Feten Baccar Ben Amar, Bilel Gargouri, Abdelmajid Ben Hamadou: Towards Generation of Domain Ontology from LMF Standardized Dictionaries. Proceedings of the Twenty-Second International Conference on Software Engineering & Knowledge Engineering (SEKE 2010), p. 515-520. 2010.

 Carolina Evaristo Scarton, Sandra Maria Aluisio. Análise da Inteligibilidade de textos via ferramentas de Processamento de Lingua Natural: adaptando as métricas do Coh Metrix para o Português. In Linguamatica v.2, n.1, 2010

 J. Teixeira, L. Sarmento, and E. Oliveira, ‘Comparing verb synonym resources for Portuguese;, in Computational Processing of the Portuguese Language, 9th International Conference, Proc. (PROPOR 2010), (2010).

 Alberto Simões, José João Almeida, and Rita Farinha. Processing and Extracting Data from Dicionário Aberto. In Proceedings of 7th International Conference on Language Resources and Evaluation (LREC), Malta, 2010

 Luís Sarmento. Definition and Computation of Similarity Operations between Web-specific Lexical Items. PhD Thesis. Universidade do Porto, 2010.

 Hernani Pereira Gomes Costa. Automatic Extraction and Validation of Lexical Ontologies from text. MSc Thesis. University of Coimbra, 2010.

Year 2008 : 2 citations

 Júlio Flávio Tavares Barbas, "Extracção Automática de Informação e Conhecimento em Textos no Âmbito B2B", Universidade Nova de Lisboa 2008

 Liliana Ferreira & António Teixeira, "Linguateca e Processamento de Linguagem Natural na Área da Saúde: Alguns Comentários e Sugestões". In Luís Costa, Diana Santos & Nuno Cardoso (eds.), Perspectivas sobre a Linguateca / Actas do encontro Linguateca : 10 anos. Linguateca, 2008, pp. 43-48.