Exploració per autor "Carreras Pérez, Xavier"

A latent variable ranking model for content-based retrieval

Quattoni, Ariadna Julieta; Carreras Pérez, Xavier; Torralba, Antonio (Springer, 2012)
Text en actes de congrés
Accés restringit per política de l'editorial

Since their introduction, ranking SVM models have become a powerful tool for training content-based retrieval systems. All we need for training a model are retrieval examples in the form of triplet constraints, i.e. examples ...

A Proposal for wide-coverage Spanish named entity recognition

Arévalo, M.; Carreras Pérez, Xavier; Màrquez Villodre, Lluís; Martí Antonin, Maria Antònia; Padró, Lluís; Simon, Maria José (2002-04)
Report de recerca
Accés obert

This paper presents a proposal for wide--coverage Named Entity Recognition for Spanish. First, a linguistic description of the typology of Named Entities is proposed. Following this definition an architecture of sequential ...

A shortest-path method for arc-factored semantic role labeling

Lluis Martorell, Xavier; Carreras Pérez, Xavier; Márquez Villodre, Luís (2014)
Comunicació de congrés
Accés restringit per política de l'editorial

We introduce a Semantic Role Labeling (SRL) parser that finds semantic roles for a predicate together with the syntactic paths linking predicates and arguments. Our main contribution is to formulate SRL in terms of ...

An empirical study of semi-supervised structured conditional models for dependency parsing

Suzuki, Jun; Isozaki, Hideki; Carreras Pérez, Xavier; Collins, Michael (2009)
Text en actes de congrés
Accés obert

This paper describes an empirical study of high-performance dependency parsers based on a semi-supervised learning approach. We describe an extension of semisupervised structured conditional models (SS-SCMs) to the dependency ...

Boosting trees for anti-spam email filtering (Extended version)

Carreras Pérez, Xavier; Màrquez Villodre, Lluís (2001-10)
Report de recerca
Accés obert

In this work, a set of comparative experiments for the problem of automatically filtering unwanted electronic mail messages are performed on two public corpora: PU1 and LingSpam. Several variants of the AdaBoost algorithm ...

Exploiting diversity of margin-based classifiers

Romero Merino, Enrique; Carreras Pérez, Xavier; Màrquez Villodre, Lluís (2003-12)
Report de recerca
Accés obert

An experimental comparison among Support Vector Machines, AdaBoost and a recently proposed model for maximizing the margin with Feed-forward Neural Networks has been made on a real-world classification problem, namely ...

Exponentiated gradient algorithms for conditional random fields and max-margin Markov networks

Collins, Michael; Globerson, Amir; Koo, Terry; Carreras Pérez, Xavier; Bartlett, Peter (2008-08)
Article
Accés obert

Log-linear and maximum-margin models are two commonly-used methods in supervised machine learning, and are frequently used in structured prediction problems. Efficient learning of parameters in these models is therefore ...

Joint arc-factored parsing of syntactic and semantic dependencies

Lluis Martorell, Xavier; Carreras Pérez, Xavier; Màrquez Villodre, Lluís (2013-05)
Article
Accés restringit per política de l'editorial

In this paper we introduce a joint arc-factored model for syntactic and semantic dependency parsing. The semantic role labeler predicts the full syntactic paths that connect predicates with their arguments. This process ...

Learning task-specific bilexical embeddings

Madhyastha, Pranava S.; Carreras Pérez, Xavier; Quattoni, Ariadna Julieta (2014)
Text en actes de congrés
Accés obert

We present a method that learns bilexical operators over distributional representations of words and leverages supervised data for a linguistic relation. The learning algorithm exploits lowrank bilinear forms and induces ...

Margin maximization with feed-forward neural networks: a comparative study with support vector machines and AdaBoost

Romero Merino, Enrique; Màrquez Villodre, Lluís; Carreras Pérez, Xavier (2003-06)
Report de recerca
Accés obert

Feed-forward Neural Networks (FNN) and Support Vector Machines (SVM) are two machine learning frameworks developed from very different starting points of view. In this work a new learning model for FNN is proposed such ...

Non-projective parsing for statistical machine translation

Carreras Pérez, Xavier; Collins, Michael (2009)
Text en actes de congrés
Accés obert

We describe a novel approach for syntaxbased statistical MT, which builds on a variant of tree adjoining grammar (TAG). Inspired by work in discriminative dependency parsing, the key idea in our approach is to allow highly ...

Projective dependency parsing with perceptron

Carreras Pérez, Xavier; Surdeanu, Mihai; Màrquez Villodre, Lluís (2010)
Text en actes de congrés
Accés obert

We describe an online learning dependency parser for the CoNLL-X Shared Task, based on the bottom-up projective algorithm of Eisner (2000). We experiment with a large feature set that models: the tokens involved in ...

Simple semi-supervised dependency parsing

Koo, Terry; Carreras Pérez, Xavier; Collins, Michael (2008)
Text en actes de congrés
Accés obert

We present a simple and effective semisupervised method for training dependency parsers. We focus on the problem of lexical representation, introducing features that incorporate word clusters derived from a large unannotated ...

Spectral learning in non-deterministic dependency parsing

Luque, Franco M.; Quattoni, Ariadna Julieta; Balle Pigem, Borja de; Carreras Pérez, Xavier (2012)
Text en actes de congrés
Accés restringit per política de l'editorial

In this paper we study spectral learning methods for non-deterministic split head-automata grammars, a powerful hidden-state formalism for dependency parsing. We present a learning algorithm that, like other spectral ...

Spectral learning of weighted automata: a forward-backward perspective

Balle Pigem, Borja de; Carreras Pérez, Xavier; Luque, Franco M.; Quattoni, Ariadna Julieta (2013-10-07)
Article
Accés restringit per política de l'editorial

In recent years we have seen the development of efficient provably correct algorithms for learning Weighted Finite Automata (WFA). Most of these algorithms avoid the known hardness results by defining parameters beyond the ...

Spectral regularization for max-margin sequence tagging

Quattoni, Ariadna Julieta; Balle Pigem, Borja de; Carreras Pérez, Xavier; Globerson, Amir (2014)
Text en actes de congrés
Accés obert

We frame max-margin learning of latent variable structured prediction models as a convex optimization problem, making use of scoring functions computed by input-output observable operator models. This learning problem can ...

Structured prediction models via the matrix-tree theorem

Koo, Terry; Globerson, Amir; Carreras Pérez, Xavier; Collins, Michael (2007)
Text en actes de congrés
Accés obert

This paper provides an algorithmic framework for learning statistical models involving directed spanning trees, or equivalently non-projective dependency structures. We show how partition functions and marginals for directed ...

TAG, dynamic programming, and the perceptron for efficient, feature-rich parsing

Carreras Pérez, Xavier; Collins, Michael; Koo, Terry (Coling 2008 Organizing Committee, 2008)
Text en actes de congrés
Accés obert

We describe a parsing approach that makes use of the perceptron algorithm, in conjunction with dynamic programming methods, to recover full constituent-based parse trees. The formalism allows a rich set of parse-tree ...

Translate first reorder later: leveraging monotonicity in semantic parsing

Cazzaro, Francesco; Locatelli, Davide; Quattoni, Ariadna Julieta; Carreras Pérez, Xavier (Association for Computational Linguistics, 2023)
Text en actes de congrés
Accés obert

Prior work in semantic parsing has shown that conventional seq2seq models fail at compositional generalization tasks. This limitation led to a resurgence of methods that model alignments between sentences and their ...

Unsupervised spectral learning of finite-state transducers

Bailly, Raphaël; Carreras Pérez, Xavier; Quattoni, Ariadna Julieta (2012)
Text en actes de congrés
Accés obert

Finite-State Transducers (FST) are a standard tool for modeling paired inputoutput sequences and are used in numerous applications, ranging from computational biology to natural language processing. Recently Balle et al. ...