Using dependency parsing and machine learning for factoid question answering on spoken documents
Tipus de documentComunicació de congrés
Condicions d'accésAccés obert
This paper presents our experiments in question answering for speech corpora. These experiments focus on improving the answer extraction step of the QA process. We present two approaches to answer extraction in question answering for speech corpora that apply machine learning to improve the coverage and precision of the extraction. The first one is a reranker that uses only lexical information, the second one uses dependency parsing to score robust similarity between syntactic structures. Our experimental results show that the proposed learning models improve our previous results using only hand-made ranking rules with small syntactic information. Moreover, this results show also that a dependency parser can be useful for speech transcripts even if it was trained with written text data from a news collection. We evaluate the system on manual transcripts of speech from EPPS English corpus and a set of questions transcribed from spontaneous oral questions. This data belongs to the CLEF 2009 track on QA on speech transcripts (QAst).
CitacióComas, P.; Turmo, J.; Màrquez, L. Using dependency parsing and machine learning for factoid question answering on spoken documents. A: Annual Conference of the International Speech Communication Association. "Proceedings of Interspeech 2010 : spoken language processing for all". Makuhari: 2010, p. 1-4.
Versió de l'editorhttp://nlp.lsi.upc.edu/papers/comas10b.pdf