Speechdat-car: towards a Collection of Speech Databases for Automotive Environments
نویسندگان
چکیده
The SpeechDat-Car project is a 4th framework EC project in the Language Engineering programme. It aims at collecting a set of nine speech databases to support training and testing of robust multilingual speech recognition for in-car applications. The consortium participants are car manufacturers, telephone communications providers, and universities. This paper describes the background of the project, its organisation, and the design of the databases in terms of contents, speaker and environment coverage. It further addressses the recording platforms, the validation scenario, and the links with other projects.
منابع مشابه
SPEECHDAT-CAR. A Large Speech Database for Automotive Environments
The aims of the SpeechDat-Car project are to develop a set of speech databases to support training and testing of multilingual speech recognition applications in the car environment. As a result, a total of ten (10) equivalent and similar resources will be created. The 10 languages are Danish, each language 600 sessions will be recorded (from at least 300 speakers) in seven characteristic envir...
متن کاملSpeechdat-car: Speech Databases for Voice Driven Teleservices and Control of In-car Applications
The SpeechDat-Car project included in the 4 framework of the European Community's Language Engineering Programme, started in April 1998 with a duration of 30 months. It is a common initiative of car manufacturers, telephone communications operators, companies active in voice operated services and Universities that aims at collecting a set of speech databases in nine different languages to suppo...
متن کاملFirst experiences of the German speechdat-car database collection in mobile environments
In SpeechDat-Car, speech databases for speech driven devices and services for mobile environments are collected for nine European languages. The German SpeechDat-Car installation was the first fully equipped platform within the project. It has served as a testbed for the recording software for the entire project, and as an opportunity to perform technical and organizational feasibility tests fo...
متن کاملTowards Large Databases for Music Information Retrieval Systems Development and Evaluation
In the context of MIR/MDL evaluation, a key component for evaluation would be the availability to the research community of a large corpus of test data consisting of both audio and structured music data. This paper proposes a possible path towards this goal by following the basic principles of the SpeechDat projects. SpeechDat refers to successive EC supported projects of large scale multilingu...
متن کاملSpeechDat-Car Fixed Platform
SpeechDat-Car aims to develop a set of speech databases to support training and testing of multilingual speech recognition applications in the car environment. Two types of recordings compose the database. The first type consist of wideband audio signals recorded directly in the car while the second type is composed by GSM signals transmitted from the car and recorded simultaneously in a far-en...
متن کامل