The TALP-UPC approach to system selection: ASIYA features and pairwise classification using random forests
View/Open
W13-2244.pdf (174,1Kb) (Restricted access)
Request copy
Què és aquest botó?
Aquest botó permet demanar una còpia d'un document restringit a l'autor. Es mostra quan:
- Disposem del correu electrònic de l'autor
- El document té una mida inferior a 20 Mb
- Es tracta d'un document d'accés restringit per decisió de l'autor o d'un document d'accés restringit per política de l'editorial
Document typeConference report
Defense date2013
Rights accessRestricted access - publisher's policy
All rights reserved. This work is protected by the corresponding intellectual and industrial
property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public
communication or transformation of this work are prohibited without permission of the copyright holder
Abstract
This paper describes the TALP-UPC participation in the WMT’13 Shared Task on Quality Estimation (QE). Our participation is reduced to task 1.2 on System Selection. We used a broad set of features (86 for German-to-English and 97 for English-to-Spanish) ranging from standard
QE features to features based on pseudo-references and semantic similarity. We approached system selection by means of pairwise ranking decisions. For that, we learned Random Forest classifiers especially tailored for the problem. Evaluation at development time showed considerably good results in a cross-validation experiment,
with Kendall’s values around 0.30. The results on the test set dropped
significantly, raising different discussions to be taken into account.
CitationFormiga, L. [et al.]. The TALP-UPC approach to system selection: ASIYA features and pairwise classification using random forests. A: Workshop on Statistical Machine Translation. "Proceedings of the Eighth Workshop on Statistical Machine Translation". Nice: 2013, p. 359-364.
Collections
- GPLN - Grup de Processament del Llenguatge Natural - Ponències/Comunicacions de congressos [192]
- Departament de Ciències de la Computació - Ponències/Comunicacions de congressos [1.250]
- VEU - Grup de Tractament de la Parla - Ponències/Comunicacions de congressos [437]
- Departament de Teoria del Senyal i Comunicacions - Ponències/Comunicacions de congressos [3.270]
Files | Description | Size | Format | View |
---|---|---|---|---|
W13-2244.pdf![]() | 174,1Kb | Restricted access |