Refinery: An Open Source Topic Modeling Web Platform

نویسندگان

  • Dae Il Kim
  • Benjamin F. Swanson
  • Michael C. Hughes
  • Erik B. Sudderth
چکیده

We introduce Refinery, an open source platform for exploring large text document collections with topic models. Refinery is a standalone web application driven by a graphical interface, so it is usable by those without machine learning or programming expertise. Users can interactively organize articles by topic and also refine this organization with phrase-level analysis. Under the hood, we train Bayesian nonparametric topic models that can adapt model complexity to the provided data with scalable learning algorithms. The project website http://daeilkim.github.io/refinery/ contains Python code and further documentation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Testing Methodology for an Open Software E-learning Platform

This paper presents an outline of a methodology for the test of an open software e-learning platform. The methodology includes a test generation method from UML descriptions, in particular Sequence, Activity, Class diagrams and Navigation maps. It also includes data modelling proposing two approaches: one based on OCL and the other on UML profiles. The studied open software e-learning platform ...

متن کامل

Experimentation of a socially constructed “Topic Map” by the OSS community

We present, in this article, a “topic map” system applied to the Open Source Software (OSS) community. Our approach is deliberately open and based on the HyperTopic model created by TechCICO lab. Our collective experimentation aims at the construction of a shared information platform that would be visible and useable for the OSS community. Thanks to this platform, OSS community members can desc...

متن کامل

Evolvable Semantic Platform for Facilitating Knowledge Exchange

The authors propose new formal foundations and design approach to develop an evolving semantic platform for finding experts relevant to events arising in the open environment of modern economical clusters. This work offers a new implementation of probabilistic latent topic modeling method with two linked indicators (categories and experts) to mach expertise. In order to show feasibility of the ...

متن کامل

Web Services for an Open Generalisation Research Platform

While automated access and presentation of cartographic data over the internet are defined, services for automated generalisation are not yet standardised. This paper aims to show advantages of applying the service concept to generalisation for the development of a common research platform, where researchers would have access to a common generalisation framework. There follows a detailed explan...

متن کامل

AToMPM: A Web-based Modeling Environment

We introduce AToMPM, an open-source framework for designing domain-specific modeling environments, performing model transformations, manipulating and managing models. It runs completely over the web, making it independent from any operating system, platform, or device it may execute on. AToMPM offers an online collaborative experience for modeling. Its unique architecture makes the framework fl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of Machine Learning Research

دوره 18  شماره 

صفحات  -

تاریخ انتشار 2017