A Breadth-First Parsing Model
نویسنده
چکیده
Recent attempts at modeling humans' abilities at processing natural language have centered around depth first parsing algorithms, and control strategies for making the best choices for disambiguation and attachment. This paper proposes a breadth-first algorithm as a model. The algorithm avoids some of the common pitfalls of depth-first approaches regarding ambiguity, and by using more pre-ccmputed information about the grammar, avoids same of the usual problems of parallel parsing algorithms as well. In the study of computational models of human language processing, cognitive scientists seem to have given little attention to all-paths parsers, focusing instead on depth-first algorithms. This restriction is imposed so that the models will be consistent with the fact that people do not generally perceive ambiguities. In addition, it is an attempt to stay in line with the hypothesis that people parse sentences in linear time. The idea is that the fewer alternatives considered, the faster the parse time should be. The fastest way, of course, would be a depth first parse which made the right choice at every step of the way, hence the interest in deterministic parsers. The attempt to find principles which would guide a parser correctly through a depth-first search, led (Kimball, 1973) to formulate the principles of Right Association and Closure. propose the principles of Lexical Preference and Final Arguments. All of these principles try to account for how a top-down depth-first parser could get the preferred readings of sentences. However, the point I would like to make here is that for each choice point in an ambiguous example, there is a second alternative which the parser needs to be able to get at least some of the time. As (Crain and Steedman, 1981) point out, the fact that people generally only perceive one reading of a sentence is perfectly consistent with a parsing model which finds all the possible syntactic ramifications from looking at a word, does some contextual filtering to decide which alternative(s) to keep, and then looks at a new word and repeats. Given this, it is not obvious that the breadth-first approach is inferior. Since (Earley's, 1970) and (Pratt's, 1975) demonstrations of how a parser can work both bottom-up, and top-down, there have been several proposals for how this information might be used to good effect in a psychological model. (Chester, 1980) proposes a depth-first left-corner parser which uses top-down information. (Martin, Church, and Patil, 1981) propose an …
منابع مشابه
An improved joint model: POS tagging and dependency parsing
Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...
متن کاملOntology-Based Secure XML Content Distribution
This paper presents an ontology-driven secure XML content distribution scheme. This scheme first relies on a semantic access control model for XML documents that achieves three objectives: (1) representing flexible and evolvable policies, (2) providing a high-level mapping and interoperable interface to documents, and (3) automating the granting of fine-grained access rights by inferring on con...
متن کاملJoint multilingual learning for coreference resolution
Natural language is a pervasive human skill not yet fully achievable by automated computing systems. The main challenge is understanding how to computationally model both the depth and the breadth of natural languages. In this thesis, I present two probabilistic models that systematically model both the depth and the breadth of natural languages for two different linguistic tasks: syntactic par...
متن کاملLinguistic processing using a dependency structure grammar for speech recognition and understanding
This paper proposes an efficient linguistic processing strategy for speech recognition and understanding using a dependency structure grammar. The strategy includes parsing and phrase prediction algorithms. After speech processing and phrase recognition based on phoneme recognition, the parser extracts the sentence with the best likelihood taking account of the phonetic likelihood of phrase can...
متن کاملبررسی مقایسهای تأثیر برچسبزنی مقولات دستوری بر تجزیه در پردازش خودکار زبان فارسی
In this paper, the role of Part-of-Speech (POS) tagging for parsing in automatic processing of the Persian language is studied. To this end, the impact of the quality of POS tagging as well as the impact of the quantity of information available in the POS tags on parsing are studied. To reach the goals, three parsing scenarios are proposed and compared. In the first scenario, the parser assigns...
متن کامل