ONZE Miner: Development of a browser-based research tool
نویسندگان
چکیده
The Origins of New Zealand English project (ONZE) at the University of Canterbury houses a large audio corpus. Until a few years ago, this corpus was stored as a series of audio tapes and Microsoft Word documents. However the corpus is now housed on a central server, and can be interacted with via the tailor-made software ‘ONZE Miner’. ONZE Miner is a digitally interactive database which enables researchers to search across and interact with sound files. It houses time-aligned transcripts of the sound-files, which are tagged for phonological, grammatical and morphological information, all of which is searchable. The researcher can conduct acoustic analysis of sound files directly via the ONZE Miner interface. Search results can be exported into excel, together with hypertext links to the relevant sound files. This paper describes the development and the architecture of the ONZE Miner software. Manuscript currently under review, December 2006. ONZE Miner: Development of a browser-based research tool. Abstract The Origins of New Zealand English project (ONZE) at the University of Canterbury houses a large audio corpus. Until a few years ago, this corpus was stored as a series of audio tapes and Microsoft Word documents. However the corpus is now housed on a central server, and can be interacted with via the tailor-made software ‘ONZE Miner’. ONZE Miner is a digitally interactive database which enables researchers to search across and interact with sound files. It houses time-aligned transcripts of the sound-files, which are tagged for phonological, grammatical and morphological information, all of which is searchable. The researcher can conduct acoustic analysis of sound files directly via the ONZE Miner interface. Search results can be exported into excel, together with hypertext links to the relevant sound files. This paper describes the development and the architecture of the ONZE Miner software.The Origins of New Zealand English project (ONZE) at the University of Canterbury houses a large audio corpus. Until a few years ago, this corpus was stored as a series of audio tapes and Microsoft Word documents. However the corpus is now housed on a central server, and can be interacted with via the tailor-made software ‘ONZE Miner’. ONZE Miner is a digitally interactive database which enables researchers to search across and interact with sound files. It houses time-aligned transcripts of the sound-files, which are tagged for phonological, grammatical and morphological information, all of which is searchable. The researcher can conduct acoustic analysis of sound files directly via the ONZE Miner interface. Search results can be exported into excel, together with hypertext links to the relevant sound files. This paper describes the development and the architecture of the ONZE Miner software.
منابع مشابه
LaBB-CAT: an Annotation Store
“ONZE Miner”, an open-source tool for storing and automatically annotating Transcriber transcripts, has been redeveloped to use “annotation graphs” as its data model. The annotation graph framework provides the new software, “LaBB-CAT”, greater flexibility for automatic and manual annotation of corpus data at various independent levels of granularity, and allows more sophisticated annotation st...
متن کاملEvolvingWeb-Based Test Automation into Agile Business Specifications
Usually, test automation scripts for a web application directly mirror the actions that the tester carries out in the browser, but they tend to be verbose and repetitive, making them expensive to maintain and ineffective in an agile setting. Our research has focussed on providing tool-support for business-level, example-based specifications that are mapped to the browser level for automatic ver...
متن کاملNavigating Multimodal Meeting Recordings with the Meeting Miner
We present Meeting Miner, a multimodal meeting browser for navigating recordings of online text and speech collaborative meetings. Meetings are recorded through a collaborative writing environment specially designed to capture participants activities. This information, usually lost in common recordings of multimodal meetings, offers novel possibilities for indexing, navigation and information r...
متن کاملFUZZY GRAVITATIONAL SEARCH ALGORITHM AN APPROACH FOR DATA MINING
The concept of intelligently controlling the search process of gravitational search algorithm (GSA) is introduced to develop a novel data mining technique. The proposed method is called fuzzy GSA miner (FGSA-miner). At first a fuzzy controller is designed for adaptively controlling the gravitational coefficient and the number of effective objects, as two important parameters which play major ro...
متن کاملThe eShopmonitor: A comprehensive data extraction tool for monitoring Web sites
Typical commercial Web sites publish information from multiple back-end data sources; these data sources are also updated very frequently. Given the size of most commercial sites today, it becomes essential to have an automated means of checking for correctness and consistency of data. The eShopmonitor allows users to specify items of interest to be tracked, monitors these items on the Web page...
متن کامل