Browsing by Subject "Big data"
Now showing items 1-20 of 259
-
A cost-based storage format selector for materialized results in big data frameworks
(2019-05-08)
Article
Open AccessModern big data frameworks (such as Hadoop and Spark) allow multiple users to do large-scale analysis simultaneously, by deploying data-intensive workflows (DIWs). These DIWs of different users share many common tasks (i.e, ... -
A general guide to applying machine learning to computer architecture
(2018)
Article
Open AccessThe resurgence of machine learning since the late 1990s has been enabled by significant advances in computing performance and the growth of big data. The ability of these algorithms to detect complex patterns in data which ... -
A methodology for Spark parameter tuning
(2017-05-19)
Article
Open AccessSpark has been established as an attractive platform for big data analysis, since it manages to hide most of the complexities related to parallelism, fault tolerance and cluster setting from developers. However, this comes ... -
A new reliability-based data-driven approach to simulation-based models
(Barcelona Supercomputing Center, 2017-05-04)
Conference report
Open AccessData Science has burst into simulation-based en-gineering sciences with an impressive impulse. However, data are never uncertainty-free and a suitable approach is needed to face data measurement errors and their intrinsic ... -
A New role of ontologies and advanced scientific visualization in big data analytics
(Barcelona Supercomputing Center, 2016-09-10)
Conference report
Open AccessAccessing and contextual semantic searching structured, semi-structured and unstructured information resources and their ontology based analysis in a uniform way across text-free Big Data query implementation is a main ... -
A programming model for hybrid workflows: combining task-based workflows and dataflows all-in-one
(Elsevier, 2020-12)
Article
Restricted access - publisher's policyIn the past years, e-Science applications have evolved from large-scale simulations executed in a single cluster to more complex workflows where these simulations are combined with High-Performance Data Analytics (HPDA). ... -
A Quick View on Current Techniques and Machine Learning Algorithms for Big Data Analytics
(Institute of Electrical and Electronics Engineers (IEEE), 2016)
Conference report
Open AccessBig-data is an excellent source of knowledge and information from our systems and clients, but dealing with such amount of data requires automation, and this brings us to data mining and machine leaming techniques. In ... -
A resilient and distributed near real-time traffic forecasting application for Fog computing environments
(Elsevier, 2018-10)
Article
Open AccessIn this paper we propose an architecture for a city-wide traffic modeling and prediction service based on the Fog Computing paradigm. The work assumes an scenario in which a number of distributed antennas receive data ... -
A resilient and distributed near real-time traffic forecasting application for Fog computing environments
(Elsevier, 2018-10-01)
Article
Open AccessIn this paper we propose an architecture for a city-wide traffic modeling and prediction service based on the Fog Computing paradigm. The work assumes an scenario in which a number of distributed antennas receive data ... -
A scalable synthetic traffic model of Graph500 for computer networks analysis
(2017-12-25)
Article
Open AccessThe Graph500 benchmark attempts to steer the design of High-Performance Computing systems to maximize the performance under memory-constricted application workloads. A realistic simulation of such benchmarks for architectural ... -
A software reference architecture for semantic-aware big data systems
(2016-06-13)
Article
Open AccessContext: Big Data systems are a class of software systems that ingest, store, process and serve massive amounts of heterogeneous data, from multiple sources. Despite their undisputed impact in current society, their ... -
Acceleració d'algoritmes de clustering mitjançant targetes gràfiques
(Universitat Politècnica de Catalunya, 2017-06-29)
Bachelor thesis
Open AccessEn aquest projecte s'ha estudiat com es comporten diferents algoritmes de clustering amb diferents opotimitzacions de CUDA. Concretament s'han estudiat com funcionen l'algoritme k-means i k-centers amb les optimitzacions ... -
Accelerating Hash-Based Query Processing Operations on FPGAs by a Hash Table Caching Technique
(Springer International Publishing, 2017-04-29)
Conference lecture
Open AccessExtracting valuable information from the rapidly growing field of Big Data faces serious performance constraints, especially in the software-based database management systems (DBMS). In a query processing system, hash-based ... -
Aircraft conflict detection using data mining and statistical tools
(Universitat Politècnica de Catalunya, 2016-10-28)
Bachelor thesis
Open AccessNowadays and taking into account the different indicators and the forecast of the air traffic volume from Eurocontrol, it is accepted the idea that the traffic will be doubled by 2030, that is why it is so important ... -
Aligning textual and model-based process descriptions
(Elsevier, 2018-01-01)
Article
Open AccessProcess model descriptions are an ubiquitous source of information that exists in any organization. To reach different types of stakeholders, distinct descriptions are often kept, so that process understandability is boosted ... -
ALOJA: A benchmarking and predictive platform for big data performance analysis
(Springer, 2016)
Conference report
Open AccessThe main goals of the ALOJA research project from BSC-MSR, are to explore and automate the characterization of cost-effectivenessof Big Data deployments. The development of the project over its first year, has resulted in ... -
ALOJA: A framework for benchmarking and predictive analytics in Hadoop deployments
(Institute of Electrical and Electronics Engineers (IEEE), 2015-10)
Article
Open AccessThis article presents the ALOJA project and its analytics tools, which leverages machine learning to interpret Big Data benchmark performance data and tuning. ALOJA is part of a long-term collaboration between BSC and ... -
ALOJA: a systematic study of Hadoop deployment variables to enable automated characterization of cost-effectiveness
(Institute of Electrical and Electronics Engineers (IEEE), 2014)
Conference report
Open AccessThis article presents the ALOJA project, an initiative to produce mechanisms for an automated characterization of cost-effectiveness of Hadoop deployments and reports its initial results. ALOJA is the latest phase of a ... -
An analysis of Bicing mobility patterns using big data
(Universitat Politècnica de Catalunya, 2016-06-23)
Master thesis
Open AccessNowadays, technology advances really fast and so does the generation of data. Almost all electronic devices are constantly generating and sharing a huge amount of data through the World Wide Web. Moreover, recent policies ... -
An auction framework for DaaS in cloud computing
(Springer, 2018)
Conference report
Restricted access - publisher's policyData as a Service (DaaS) is the next emerging technology in cloud computing research. Small clouds operating as a group may exploit the DaaS efficiently to perform substantial amount of work. In this paper an auction ...