Ir al contenido (pulsa Retorno)

Universitat Politècnica de Catalunya

    • Català
    • Castellano
    • English
    • LoginRegisterLog in (no UPC users)
  • mailContact Us
  • world English 
    • Català
    • Castellano
    • English
  • userLogin   
      LoginRegisterLog in (no UPC users)

UPCommons. Global access to UPC knowledge

Banner header
59.757 UPC E-Prints
You are here:
View Item 
  •   DSpace Home
  • E-prints
  • Centres de recerca
  • BSC - Barcelona Supercomputing Center
  • Computer Sciences
  • Ponències/Comunicacions de congressos
  • View Item
  •   DSpace Home
  • E-prints
  • Centres de recerca
  • BSC - Barcelona Supercomputing Center
  • Computer Sciences
  • Ponències/Comunicacions de congressos
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

ALOJA: a systematic study of Hadoop deployment variables to enable automated characterization of cost-effectiveness

Thumbnail
View/Open
ALOJA paper (705,7Kb)
Share:
 
 
10.1109/BigData.2014.7004322
 
  View Usage Statistics
Cita com:
hdl:2117/28152

Show full item record
Poggi Mastrokalo, Nicolas
Carrera Pérez, DavidMés informació
Call, AaronMés informació
Mendoza, Sergio
Becerra Fontal, YolandaMés informacióMés informacióMés informació
Torres Viñals, JordiMés informacióMés informacióMés informació
Ayguadé Parra, EduardMés informacióMés informacióMés informació
Gagliardi, Fabrizio
Labarta Mancho, Jesús JoséMés informacióMés informacióMés informació
Reinauer, Rob
Vujic, Nikola
Green, Daron
Blakeley, Jose
Document typeConference report
Defense date2014
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Rights accessOpen Access
All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder
Abstract
This article presents the ALOJA project, an initiative to produce mechanisms for an automated characterization of cost-effectiveness of Hadoop deployments and reports its initial results. ALOJA is the latest phase of a long-term collaborative engagement between BSC and Microsoft which, over the past 6 years has explored a range of different aspects of computing systems, software technologies and performance profiling. While during the last 5 years, Hadoop has become the de-facto platform for Big Data deployments, still little is understood of how the different layers of the software and hardware deployment options affects its performance. Early ALOJA results show that Hadoop's runtime performance, and therefore its price, are critically affected by relatively simple software and hardware configuration choices e.g., number of mappers, compression, or volume configuration. Project ALOJA presents a vendor-neutral repository featuring over 5000 Hadoop runs, a test bed, and tools to evaluate the cost-effectiveness of different hardware, parameter tuning, and Cloud services for Hadoop. As few organizations have the time or performance profiling expertise, we expect our growing repository will benefit Hadoop customers to meet their Big Data application needs. ALOJA seeks to provide both knowledge and an online service to with which users make better informed configuration choices for their Hadoop compute infrastructure whether this be on-premise or cloud-based. The initial version of ALOJA's Web application and sources are available at http://hadoop.bsc.es.
CitationPoggi, N. [et al.]. ALOJA: a systematic study of Hadoop deployment variables to enable automated characterization of cost-effectiveness. A: IEEE International Conference on Big Data. "2014 IEEE International Conference on Big Data: 27-30 October 2014, Washington DC, USA: proceedings". Washington DC: Institute of Electrical and Electronics Engineers (IEEE), 2014, p. 905-913. 
URIhttp://hdl.handle.net/2117/28152
DOI10.1109/BigData.2014.7004322
ISBN978-1-4799-5665-4
Publisher versionhttp://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=7004322
Collections
  • Computer Sciences - Ponències/Comunicacions de congressos [501]
  • CAP - Grup de Computació d'Altes Prestacions - Ponències/Comunicacions de congressos [782]
  • Departament d'Arquitectura de Computadors - Ponències/Comunicacions de congressos [1.848]
Share:
 
  View Usage Statistics

Show full item record

FilesDescriptionSizeFormatView
BSC-MSR_ALOJA.pdfALOJA paper705,7KbPDFView/Open

Browse

This CollectionBy Issue DateAuthorsOther contributionsTitlesSubjectsThis repositoryCommunities & CollectionsBy Issue DateAuthorsOther contributionsTitlesSubjects

© UPC Obrir en finestra nova . Servei de Biblioteques, Publicacions i Arxius

info.biblioteques@upc.edu

  • About This Repository
  • Contact Us
  • Send Feedback
  • Privacy Settings
  • Inici de la pàgina