2 past Research: Information Retrieval Models
نویسنده
چکیده
My research interests lie in Machine Learning (ML) and its applications to Information Retrieval (IR), Text analysis and Data Mining (DM). These applied areas have historically been driven by empirical approaches. While these approaches have been quite successful in terms of performance, one of their major drawbacks is their lack of easy interpretability. It is my firm belief that machine learning approaches are a key towards solving some of the complex problems in text analysis and IR for the following reasons: (i) ML techniques are theoretically well grounded, making them easily interpretable, (ii) the theoretical framework of ML techniques acts as a guidance for future researchers to build extensions to existing models and (iii) the theoretical foundation also makes the models generalizable and portable across different tasks and domains. In line with this philosophy, my research has thus far been geared towards advancing the state-of-the-art technology in the above mentioned areas using theoretically well motivated, machine learning based approaches. While staying within the confines of this broad objective, I consciously made an effort not to limit myself to any particular school of thought and retained the flexibility to build models in diverse frameworks such as discriminative and generative, supervised and unsupervised approaches. I started out my research career as a graduate student in IR. I describe several theoretical contributions I made to this area in section 2. Not content with the knowledge I gained in my graduate study, I accepted a post doctoral fellowship in the Machine Learning department of Carnegie Mellon University (CMU), with a strong desire to sharpen my skills in Machine Learning. In section 3, I discuss my current research at CMU on topic modeling for data mining from large document collections. Section 4 describes my future research directions and section 5 concludes the statement with a discussion on my philosophy on other matters pertaining to a successful research career.
منابع مشابه
کاربست مدل بازیابی تخصص برای یافتن نویسندگان خبره
This research applied Expertise Retrieval model for finding expert authors, and used evaluation methods of Information Retrieval systems for measuring the performance of those models. Current research is an experimental one. Besides, a variety of methods including survey method has been used in the research process. Various models were developed for finding expert authors, all built on a known ...
متن کاملImproved Skips for Faster Postings List Intersection
Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...
متن کاملImproved Skips for Faster Postings List Intersection
Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...
متن کاملبررسی تأثیرات ریشهیابی در بازیابی اطلاعات در زبان فارسی
Using the language-specific behavior in information retrieval systems can improve the quality of the retrieved results significantly. Part of the word that remains after removing its affixes is called stem. Stemming process can be used for improving the relevancy of the results in information retrieval system. Different morphological variants of words (plural, past tense…) will be mapped into t...
متن کاملA Graph-Based Information Retrieval Model
The goal of information retrieval is to effectively retrieve documents relevant to users’ queries. A variety of models and techniques have been proposed over the past 50 years. In this research, we introduce a novel, graph-based approach to information retrieval. Its computation is fast and scalable, and its structure is flexible to incorporate many performance enhancement techniques. The perfo...
متن کاملQEA: A New Systematic and Comprehensive Classification of Query Expansion Approaches
A major problem in information retrieval is the difficulty to define the information needs of user and on the other hand, when user offers your query there is a vast amount of information to retrieval. Different methods , therefore, have been suggested for query expansion which concerned with reconfiguring of query by increasing efficiency and improving the criterion accuracy in the information...
متن کامل