Firms' knowledge profiles: Mapping patent data with unsupervised learning
نویسندگان
چکیده
منابع مشابه
Unsupervised Learning with Permuted Data
We consider the problem of unsupervised learning from a matrix of data vectors where in each row the observed values are randomly permuted in an unknown fashion. Such problems arise naturally in areas such as computer vision and text modeling where measurements need not be in correspondence with the correct features. We provide a general theoretical characterization of the difficulty of “unscra...
متن کاملMultinational Firms and International Knowledge Diffusion: Evidence using Patent Citation Data
This paper addresses three questions: (i) Are multinational firms (MNCs) really better than markets at transferring knowledge across borders? (ii) How actively do MNCs exchange knowledge with their host countries? (iii) Do they contribute as much to local knowledge as they learn from their host countries? To answer these questions, I analyze data on citations for over half a million patents fro...
متن کاملMapping Semantic Knowledge for Unsupervised Text Categorisation
Text categorisation is challenging, due to the complex structure with heterogeneous, changing topics in documents. The performance of text categorisation relies on the quality of samples, effectiveness of document features, and the topic coverage of categories, depending on the employing strategies; supervised or unsupervised; single labelled or multi-labelled. Attempting to deal with these rel...
متن کاملUnsupervised Learning with Non-Ignorable Missing Data
In this paper we explore the topic of unsupervised learning in the presence of nonignorable missing data with an unknown missing data mechanism. We discuss several classes of missing data mechanisms for categorical data and develop learning and inference methods for two specific models. We present empirical results using synthetic data which show that these algorithms can recover both the unkno...
متن کاملBayesian unsupervised learning with multiple data types.
We propose Bayesian generative models for unsupervised learning with two types of data and an assumed dependency of one type of data on the other. We consider two algorithmic approaches, based on a correspondence model, where latent variables are shared across datasets. These models indicate the appropriate number of clusters in addition to indicating relevant features in both types of data. We...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Technological Forecasting and Social Change
سال: 2017
ISSN: 0040-1625
DOI: 10.1016/j.techfore.2016.09.028