NaniBD: a set of tools for transcribing and validating speech databases
Tipus de documentText en actes de congrés
EditorEuropean Language Resources Association (ELRA)
Condicions d'accésAccés obert
This paper describes NaniBD, a set of tools designed for transcribing and validating speech databases, developed at the Signal Processing Group (GPS) of the Department of Signal Theory and Communications of the Polytechnic University of Catalonia (TSC/UPC). The main purpose of its development was the need of a revision system in order to validate and annotate the Spanish corpus of SpeechDat (II) in the speech processing environment available at GPS. Despite of this, NaniBD is designed as a general-purpose system that might fit any other database, idiom or speech processing system. So far, the system has been used to revise some 200,000 speech files from three different corpora. In this paper we will focus our attention to the actual implementation used in the transcription of a SpeechDat (II) specifications compatible Catalonian corpus. 1000 speakers, each of them uttering 44 files, compose this corpus. In this application, we use speech-noise detection, automatic recognition of spontaneous prompts, digit and letter to text translation and access to an external database in order to minimise the amount of time spent by human operators in the revision procedure.
CitacióNogueiras, A.; Moreno, M. NaniBD: a set of tools for transcribing and validating speech databases. A: International Conference on Language Resources and Evaluation. "LREC 1998: 1st International Conference on Language Resources and Evaluation: proceedings". Granada: European Language Resources Association (ELRA), 1998.