OpenWordNet-PT: A Project Report
نویسندگان
چکیده
This paper presents OpenWordNet-PT, a freely available open-source wordnet for Portuguese, with its latest developments and practical uses. We provide a detailed description of the RDF representation developed for OpenWordnet-PT. We highlight our efforts to extend the coverage of our resource and add nominalization relations connecting nouns and verbs. Finally, we present several real-world applications where OpenWordnet-PT was put to use, including a large-scale high-throughput sentiment analysis system.
منابع مشابه
Embedding NomLex-BR nominalizations into OpenWordnet-PT
This paper presents NomLex-BR, a lexical resource describing Brazilian Portuguese nominalizations, and its integration with OpenWordnet-PT. We first describe the original English NOMLEX lexical resource and how we used it to bootstrap a Portuguese version. Subsequently, we describe how this lexicon can be embedded into OpenWordnet-PT, which facilitates its use and helps spot-checking both the b...
متن کاملAnotação de corpus com a OpenWordNet-PT: um exercício de desambiguação (Sense annotation with OpenWordNet-PT: an exercise of word sense disambiguation)
This paper presents the first effort towards a portuguese wordnet annotated corpus. We mannualy annotated 30 sentences, using the OpenWordNetPT as a lexicon, and then compared the results with an automatic annotation. In addition to the system’s evaluation, the results provided valuable insights about how to deal with this ambitious task. Resumo. O presente trabalho apresenta o primeiro passo e...
متن کاملSeeing is Correcting: curating lexical resources using social interfaces
This note describes OpenWordnet-PT, an automatically created, manually curated wordnet for Portuguese and introduces the newly developed web interface we are using to speed up its manual curation. OpenWordNet-PT is part of a collection of wordnets for various languages, jointly described and distributed through the Open MultiLingual WordNet and the Global WordNet Association. OpenWordnet-PT has...
متن کاملNomLex-PT: A Lexicon of Portuguese Nominalizations
This paper presents NomLex-PT, a lexical resource describing Portuguese nominalizations. NomLex-PT connects verbs to their nominalizations, thereby enabling NLP systems to observe the potential semantic relationships between the two words when analysing a text. NomLex-PT is freely available and encoded in RDF for easy integration with other resources. Most notably, we have integrated NomLex-PT ...
متن کاملExtending NomLex-PT using AnCora-Nom
This work describes how we used AnCora-Nom, a Spanish nominalization lexicon, to extend NomLex-PT, a lexical resource for Portuguese, originally based on the English NomLex lexicon and fully integrated to OpenWordNet-PT, our freely available Portuguese WordNet. The complete Spanish lexicon, which contains 1,655 entries, was translated to Portuguese and then compared to our previous data. Furthe...
متن کامل