Real-Time Keyword Extraction from Conversations
نویسندگان
چکیده
We introduce a novel, fully unsupervised method to extract keywords from meeting speech in real-time. Our approach represents text as a word co-occurrence network and leverages the k-core graph decomposition algorithm and properties of submodular functions. We outperform multiple baselines in a real-time scenario emulated from the AMI and ICSI meeting corpora. Evaluation is conducted against both extractive and abstractive gold standard using two standard performance metrics and a newer one based on word embeddings.
منابع مشابه
Diverse Keyword Extraction from Conversations
A new method for keyword extraction from conversations is introduced, which preserves the diversity of topics that are mentioned. Inspired from summarization, the method maximizes the coverage of topics that are recognized automatically in transcripts of conversation fragments. The method is evaluated on excerpts of the Fisher and AMI corpora, using a crowdsourcing platform to elicit comparativ...
متن کاملDocument Recommendation for Conversation Based on Keyword Extraction and Clustering
Through this project we are extracting appropriate keyword from conversation input. Extracted keywords are matched with available documents. Finally, we recommend appropriate documents to the participants for reference. It also represents the problem faced during keyword extraction in conversation using automatic speech recognition (ASR) system which brings errors in result. In order to overcom...
متن کاملSearch Engine Optimization for Threaded-Conversations
Online discussion communities are becoming increasingly popular among web users, where an extensive amount of discussion and commenting takes place. However, it is difficult to search these conversations as search engines are not optimized for the conversation-structure of online communities. In this paper, we purpose a method of ranking search results based on the conversation-structure and us...
متن کاملA System For Searching And Browsing Spoken Communications
As the amount of spoken communications accessible by computers increases, searching and browsing is becoming crucial for utilizing such material for gathering information. It is desirable for multimedia content analysis systems to handle various formats of data and to serve varying user needs while presenting a simple and consistent user interface. In this paper, we present a research system fo...
متن کاملPolypus: a Big Data Self-Deployable Architecture for Microblogging Text Extraction and Real-Time Sentiment Analysis
In this paper we propose a new parallel architecture based on Big Data technologies for real-time sentiment analysis on microblogging posts. Polypus is a modular framework that provides the following functionalities: (1) massive text extraction from Twitter, (2) distributed non-relational storage optimized for time range queries, (3) memory-based intermodule buffering, (4) real-time sentiment c...
متن کامل