An Automatic audio segmentation system for radio newscast
Visualitza/Obre
Estadístiques de LA Referencia / Recolecta
Inclou dades d'ús des de 2022
Cita com:
hdl:2099.1/4862
Tipus de documentProjecte/Treball Final de Carrera
Data2008-03
Condicions d'accésAccés obert
Llevat que s'hi indiqui el contrari, els
continguts d'aquesta obra estan subjectes a la llicència de Creative Commons
:
Reconeixement-NoComercial-SenseObraDerivada 2.5 Espanya
Abstract
Current web search engines generally do not enable searches into audio files. Informative
metadata would allow searches into audio files, but producing such metadata is a tedious manual task. Tools for automatic production of metadata are therefore needed. This project describes the work done on the development of an automatic audio segmentation system which can be used for this metadata extraction. In this work the radio newscast are divided into segments in which there is only one speaker. Audio features used in this project include
Mel Frequency Cepstral Coefficients. This feature was extracted from audio files that were stored in a WAV format, using CLAM. Model-Selection-Based segmentation is used to
segment audio signals using this feature.
TitulacióENGINYERIA TÈCNICA DE TELECOMUNICACIÓ, ESPECIALITAT EN SO I IMATGE (Pla 2001)
Fitxers | Descripció | Mida | Format | Visualitza |
---|---|---|---|---|
VincenzoDimattia.pdf | MEMÒRIA | 5,015Mb | Visualitza/Obre |