Pazienza University of Roma Tor Vergata , Italy Armando Stellato University of Roma Tor Vergata , Italy Semi - Automatic Ontology Development : Processes and Resources
نویسنده
چکیده
The collection of the specialized vocabulary of a particular domain (terminology) is an important initial step of creating formalized domain knowledge representations (ontologies). Terminology Extraction (TE) aims at automating this process by collecting the relevant domain vocabulary from existing lexical resources or collections of domain texts. In this chapter, the authors address the extraction of multiword terminology, as multiword terms are very frequent in terminology but typically poorly represented in standard lexical resources. They present their method for mining multiword terminology from Wikipedia and the freely available terminology resource that they extracted using the presented method. Terminology extraction based on Wikipedia exploits the advantages of a huge multilingual, domain-transcending knowledge source and large scale structural information that can identify potential multiword units without the need for linguistic processing tools. Thus, while evaluated in English, the proposed method is basically applicable to all languages in Wikipedia. DOI: 10.4018/978-1-4666-0188-8.ch009
منابع مشابه
Efficient Integration of IP-Based Terrestrial and Satellite Systems: ARQ Techniques and Inter-Segment Handover
Satellite Systems: ARQ Techniques and Inter-segment Handover Ernestina Cianca∗,∗∗,Michele Angelaccio∗,Michele Luglio∗∗,Marina Ruggieri∗∗ Pasquale Daponte∗∗∗,Roberto Lojacono∗∗,Ramjee Prasad∗ (*) Center for PersonKommunikation, Aalborg University, Fr. Bajers Vej 7A5, DK-9220 Aalborg, Denmark (**) Dpt. of Electronics Engineering, University of Roma “Tor Vergata”, via Tor Vergata 110, 00133 Roma, ...
متن کاملPersonal data disclosure and data breaches: the customer's viewpoint
customer’s viewpoint Giuseppe D’Acquisto, Maurizio Naldi, and Giuseppe F. Italiano Garante per la Protezione dei Dati Personali, Piazza di Monte Citorio n. 121, Rome, Italy Università di Roma Tor Vergata, Dipartimento di Informatica Sistemi Produzione, Via del Politecnico 1, 00133 Roma, Italy Università di Roma Tor Vergata, Dipartimento di Informatica Sistemi Produzione, Via del Politecnico 1, ...
متن کاملIntestinal helminths of Italian barbel, Barbus tyberinus (Cypriniformes: Cyprinidae), from the Tiber River and first report of Acanthocephalus clavula (Acanthocephala) in the genus Barbus.
Cattedra di Parassitologia, Dipartimento di Sanità Pubblica e Biologia Cellulare, Università di Roma “Tor Vergata”, Via Montpellier 1, 00133 Roma, Italy; Laboratorio di Ecologia Sperimentale ed Acquacoltura, Dipartimento di Biologia, Università di Roma “Tor Vergata”, Via Cracovia, 00133 Roma, Italy; School of Biological Sciences, University of Exeter, Hatherly Laboratories, Prince of Wales Road...
متن کاملThe role of the uncinate fasciculus in the development of dementia: a DTI-tractography study
L. Serra, M. Cercignani, R. Perri, B. Spanò, L. Fadda, C. Marra, F. Giubilei, C. Caltagirone, and M. Bozzali Neuroimaging laboratory, Fondazione IRCCS Santa Lucia, Roma, Italy, Department of Clinical and Behavioural Neurology, Fondazione IRCCS Santa Lucia, Roma, Italy, Department of Neuroscience, University of Rome ‘Tor Vergata’, Rome, Italy, Institute of Neurology, Università Cattolica, Roma, ...
متن کاملSupervised Semantic Relation Mining from Linguistically Noisy Text Documents
In this paper, we present models for mining text relations between Named Entities, which can deal with data highly affected by linguistic noise. Our models are made robust by: (a) the exploitation of state-of-the-art statistical algorithms such as Support Vector Machines (SVMs) along with effective and versatile pattern mining methods, e.g. word sequence kernels; (b) the design of specific feat...
متن کامل