Show simple item record

dc.contributorEsquerra Llucià, Ignasi
dc.contributor.authorDimattia, Vincenzo
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.description.abstractCurrent web search engines generally do not enable searches into audio files. Informative metadata would allow searches into audio files, but producing such metadata is a tedious manual task. Tools for automatic production of metadata are therefore needed. This project describes the work done on the development of an automatic audio segmentation system which can be used for this metadata extraction. In this work the radio newscast are divided into segments in which there is only one speaker. Audio features used in this project include Mel Frequency Cepstral Coefficients. This feature was extracted from audio files that were stored in a WAV format, using CLAM. Model-Selection-Based segmentation is used to segment audio signals using this feature.
dc.publisherUniversitat Politècnica de Catalunya
dc.rightsAttribution-NonCommercial-NoDerivs 2.5 Spain
dc.subjectÀrees temàtiques de la UPC::Enginyeria de la telecomunicació
dc.subjectÀrees temàtiques de la UPC::So, imatge i multimèdia
dc.subject.lcshDigital audio
dc.subject.otherAudio files metadata
dc.titleAn Automatic audio segmentation system for radio newscast
dc.typeMaster thesis (pre-Bologna period)
dc.subject.lemacSo -- Processament de dades
dc.subject.lemacÀudio -- Software d'ordinadors
dc.rights.accessOpen Access
dc.audience.educationlevelEstudis de primer/segon cicle
dc.audience.mediatorEscola Universitària d'Enginyeria Tècnica Industrial de Terrassa

Files in this item


This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 2.5 Spain
Except where otherwise noted, content on this work is licensed under a Creative Commons license : Attribution-NonCommercial-NoDerivs 2.5 Spain