Lorify: A Knowledge Base from Scratch
نویسندگان
چکیده
In this paper we discuss our approach to the task of Cold-Start Knowledge Base Population and the challenges associated with it. We describe our knowledge base system Lorify and each of the components necessary to populate it from unstructured text. The pivotal component for building a large-scale knowledge base is scalable cross-document coreference. We address this with a novel clustering algorithm based on Markov-ChainMonteCarlo, and show that it is capable of scaling to much larger sets of entities than typical algorithms. Finally, we detail the performance of this system on the TAC KBP 2012 evaluation.
منابع مشابه
DEXSY2 A Dental Expert System for Diagnosis and Treatment
DEXSY2 is a dental expert system, which diagnoses oral diseases and offers a treatment course. The system which is designed and implemented from scratch is capable of diagnosing among thirty five oral diseases and offering a course of treatment for each. It uses a decision tree for its representation of knowledge, and each of its nodes contains a frame. The knowledge base of the system contains...
متن کاملDEXSY2 A Dental Expert System for Diagnosis and Treatment
DEXSY2 is a dental expert system, which diagnoses oral diseases and offers a treatment course. The system which is designed and implemented from scratch is capable of diagnosing among thirty five oral diseases and offering a course of treatment for each. It uses a decision tree for its representation of knowledge, and each of its nodes contains a frame. The knowledge base of the system contains...
متن کاملThe DARPA Knowledge Sharing E ort: Progress Report
Building knowledge-based systems today usually entails constructing a new knowledge base from scratch. Even if several groups of researchers are working in the same general area, such as medicine or electronic diagnosis, each team must develop its own knowledge base from scratch. The cost of this duplication of effort has been high and will become prohibitive as we build larger and larger syste...
متن کاملIntegrating Order and Distance Relationships from Heterogeneous Maps
There is no automatic mechanism to integrate information between heterogeneous genome maps. Currently, integration is a difficult, manual process. We have developed a process for knowledge base design, and we use this to integrate order and distance relationships between genetic linkage, radiation hybrid, and physical maps. Until now, the only way to develop a persistent, knowledge-intensive ap...
متن کاملTowards Meta-Engineering for Semantic Wikis
Building intelligent systems is a complex task. In many knowledge engineering projects the knowledge acquisition activities can significantly benefit from a tool, that is tailored to the specific project setting with respect to domain, contributors, and goals. Specifying and building a new tool from scratch is ambitious, tedious, and delaying. In this paper we introduce a wiki-based meta-engine...
متن کامل