LARCA és un grup de recerca de nivell internacional en temes d'aprenentatge i mineria de dades, anàlisi de dades i lingüística matemàtica. Típicament, abordem els problemes des de principis matemàtics sòlids, usant eines de modelització i tècniques de l'algorísmia, complexitat computacional, teoria d'autòmats, lògica, matemàtica discreta, estadística, i sistemes dinàmics. Estem iniciant
col·laboracions amb empreses i altres institucions per tal d'aplicar les nostres solucions al problemes del món real.

LARCA es un grupo de investigación de nivel internacional en temas de aprendizaje y minería de datos, análisis de datos y lingüística matemática. Típicamente, abordamos los problemas desde principios matemáticos sólidos, usando técnicas de modelización de la algoritmia, la complejidad computacional, teoría de autómatas, lógica, matemática discreta, estadística y sistemas dinámicos. Estamos iniciando colaboraciones con empresas e instituciones para la aplicación de nuestras soluciones a contextos reales.

LARCA is an international research group working on data mining, machine learning, data analysis, and mathematical linguistics. We typically approach problems from sound mathematical principles, using modelling tools and techniques from algorithmics, computational complexity, automata theory, logic, discrete mathematics, statistics, and dynamic systems. We are also starting partnerships with companies and other institutions to apply our solutions to real-world problems.

LARCA is an international research group working on data mining, machine learning, data analysis, and mathematical linguistics. We typically approach problems from sound mathematical principles, using modelling tools and techniques from algorithmics, computational complexity, automata theory, logic, discrete mathematics, statistics, and dynamic systems. We are also starting partnerships with companies and other institutions to apply our solutions to real-world problems.

Recent Submissions

  • Zipf's law of abbreviation as a language universal 

    Bentz, Chris; Ferrer Cancho, Ramon (University of Tübingen, 2016)
    Conference report
    Open Access
    Words that are used more frequently tend to be shorter. This statement is known as Zipf’s law of abbreviation. Here we perform the widest investigation of the presence of the law to date. In a sample of 1262 texts and 986 ...
  • Long-distance dependencies are not uniquely human 

    Ferrer Cancho, Ramon; Longa Martínez, Víctor Manuel; Lorenzo González, Guillermo (World Scientific Publishing, 2008)
    Conference report
    Restricted access - publisher's policy
    It is widely assumed that long-distance dependencies between elements are a unique feature of human language. Here we review recent evidence of long-distance correlations in sequences produced by non-human species and ...
  • Measuring investment opportunities under uncertainty 

    Castro Rabal, Jorge; Gabarró Vallès, Joaquim; Serna Iglesias, María José (Springer, 2019)
    Conference report
    Open Access
    In order to make sound economic decisions it is important to measure the possibilities offered by a market in relation to investments. Provided an investment scheme S = (r; R1, . . . , Rn), where r is a lower bound on the ...
  • Refining the imprecise meaning of non-determinism in the Web by strategic games 

    Castro Rabal, Jorge; Gabarró Vallès, Joaquim; Serna Iglesias, María José (Springer, 2019)
    Conference report
    Open Access
    Nowadays interactions with the World Wide Web are ubiquitous. Users interact through a number of steps consisting of site calls and handling results that can be automatized as orchestrations. Orchestration results have ...
  • Portfolio optimization in incomplete markets and price constraints determined by maximum entropy in the mean 

    Arratia Quesada, Argimiro Alejandro; Gzyl, Henryk (2019-12-12)
    Article
    Restricted access - publisher's policy
    A solution to a portfolio optimization problem is always conditioned by constraints on the initial capital and the price of the available market assets. If a risk neutral measure is known, then the price of each asset is ...
  • A stronger null hypothesis for crossing dependencies 

    Ferrer Cancho, Ramon (2014-12-04)
    Article
    Open Access
    The syntactic structure of a sentence can be modeled as a tree where vertices are words and edges indicate syntactic dependencies between words. It is well known that those edges normally do not cross when drawn over the ...
  • The challenges of statistical patterns of language: the case of Menzerath's law in genomes 

    Ferrer Cancho, Ramon; Forns Fradera, Núria; Hernández Fernández, Antonio; Bel Enguix, Gemma; Baixeries i Juvillà, Jaume (2013-01)
    Article
    Open Access
    The importance of statistical patterns of language has been debated over decades. Although Zipf's law is perhaps the most popular case, recently, Menzerath's law has begun to be involved. Menzerath's law manifests in ...
  • ¿Es el habla una señal crítica auto-organizada? 

    González Torre, Iván; Artime, Oriol; Hernández Fernández, Antonio; Luque Serrano, Bartolome (Universidad Nacional Autónoma de México, 2020-01-01)
    Article
    Restricted access - publisher's policy
    A lo largo del siglo XX, los estudios en lingüística cuantitativa han ido mostrando la aparición de leyes potenciales en las lenguas, primero en textos escritos y posteriormente en el habla. Son leyes que parecen ubicuas ...
  • A statistical model from information theory to explain Zipf's law of brevity 

    Hernández Fernández, Antonio; González Torre, Iván; Lacasa, Lucas; Kello, Christopher; Luque Serrano, Bartolome (Basque Center on Cognition, Brain and Language, 2019)
    Conference lecture
    Restricted access - publisher's policy
    Brevity and frequency are two crucial factors in the processes of statistical learning. The compression principle had already been used previously to explain the origin of Zipf’s law for the frequency of words. Here we use ...
  • Log-normal distribution in acoustic linguistic units 

    González Torre, Iván; Lacasa, Lucas; Kello, Christopher; Luque Serrano, Bartolome; Hernández Fernández, Antonio (Basque Center on Cognition, Brain and Language, 2019)
    Conference lecture
    Restricted access - publisher's policy
    In this work we verify with accuracy that acoustically transcribed durations of linguistic units at several scales (phonemes, words and Breath Groups) comply with log-normal distribution. To do this we have used a very ...
  • Linguistic laws in speech: the case of Catalan and Spanish 

    Hernández Fernández, Antonio; González Torre, Iván; Garrido, Juan Maria; Lacasa, Lucas (2019-11-26)
    Article
    Open Access
    In this work we consider Glissando Corpus—an oral corpus of Catalan and Spanish—and empirically analyze the presence of the four classical linguistic laws (Zipf’s law, Herdan’s law, Brevity law, and Menzerath–Altmann’s ...
  • On the physical origin of linguistic laws and lognormality in speech 

    González Torre, Iván; Luque Serrano, Bartolome; Lacasa, Lucas; Kello, Christopher; Hernández Fernández, Antonio (2019-08-21)
    Article
    Open Access
    Physical manifestations of linguistic units include sources of variability due to factors of speech production which are by definition excluded from counts of linguistic symbols. In this work, we examine whether linguistic ...

View more