Exploració per tema "Apache Spark"
Ara es mostren els items 1-9 de 9
-
Análisis, uso y desarrollo experimental de herramientas y tecnologías Open Source en Big Data
(Universitat Politècnica de Catalunya, 2017-09-12)
Treball Final de Grau
Accés obertIn this project, we pretend to analise, use and justify the use of Big Data nowadays in different areas like companies, research laboratories, etc., as well as different tools and Open Source technologies behind it that ... -
Big data analytics for obesity prediction
(Universitat Politècnica de Catalunya, 2018-04-26)
Projecte Final de Màster Oficial
Accés obert
Realitzat a/amb: Fundació EurecatFeature selection is an important technique to find the most relevant features. Apache Spark is a big data processing framework but unable to cope with approx. 0.74 million features in our Obesity dataset. However, we ... -
Discovering ship navigation patterns towards environment impact modeling
(Universitat Politècnica de Catalunya, 2017-04-28)
Projecte Final de Màster Oficial
Accés obert
Realitzat a/amb: Barcelona Supercomputing CenterShip positioning and maneuvering information is highly relevant to understand the levels of pollution on coastal cities and sea-life quality, containing latent patterns of vessels behavior, that are of utility on earth ... -
Enabling interpretation of the outcome of a human obesity prediction machine learning analysis from genomic data
(2018)
Report de recerca
Accés obertIn this brief paper, we address the medical problem of human obesity prediction from genomic data. Genomic datasets may contain a huge number of features and they often have to be analyzed within the realm of Big Data ... -
Inferring latent user attributes in streams on multimodal social data using spark
(Universitat Politècnica de Catalunya, 2015-04-29)
Projecte Final de Màster Oficial
Accés obertThe principal goal of this work can be expressed in two simple words Apache Spark; basically this framework help a developer to deal with big data. Our scope is to understand how Spark operates and use it to deal with Big ... -
Large-scale retrospective event detection from tweets through a DBSCAN-like algorithm in Apache Spark
(Universitat Politècnica de Catalunya, 2016-10-26)
Projecte Final de Màster Oficial
Accés obert
Realitzat a/amb: Barcelona Supercomputing CenterMessages posted on Location-Based Social Networks (LBSNs) such as Twitter have been reporting everything from daily life stories to the latest local and global news. Monitoring and analyzing this rich and continuous ... -
Machine learning approaches for the prediction of polygenic obesity using genome-wide genotyping data
(Universitat Politècnica de Catalunya / Universitat de Barcelona, 2020-06)
Projecte Final de Màster Oficial
Accés restringit per decisió de l'autorIn the last decade, Polygenic Risk Scores (PRSs) have been widely used to identify individuals at high risk of being obese. In the present study, we propose to consider a variety of Machine Learning (ML) algorithms to ... -
Making kernel machines scalable combining matrix approximations and distributed computing
(Universitat Politècnica de Catalunya, 2018-04-08)
Projecte Final de Màster Oficial
Accés obertIn this work, kernelized binary support vector machines are implemented based on stochastic gradient descent. The Scala library can be used both on a single computing node and on a Spark cluster. Additional tools for ... -
Scaling DBSCAN-like algorithms for event detection systems in Twitter
(Springer, 2016-11-25)
Capítol de llibre
Accés obertThe increasing use of mobile social networks has lately transformed news media. Real-world events are nowadays reported in social networks much faster than in traditional channels. As a result, the autonomous detection of ...