Show simple item record

dc.contributor.authorFischer e Silva, Renan
dc.contributor.authorCarpenter, Paul Matthew
dc.contributor.otherBarcelona Supercomputing Center
dc.date.accessioned2017-01-27T10:35:22Z
dc.date.available2017-01-27T10:35:22Z
dc.date.issued2016-12-26
dc.identifier.citationFischer e Silva, Renan; Carpenter, Paul M. Controlling Network Latency in Mixed Hadoop Clusters: Do We Need Active Queue Management?. A: 41st Conference on Local Computer Networks (LCN), 7-10 Nov. 2016. "Local Computer Networks (LCN), 2016 IEEE 41st Conference on". IEEE, 2016, p. 415-423.
dc.identifier.isbn978-1-5090-2054-6
dc.identifier.urihttp://hdl.handle.net/2117/100187
dc.description.abstractWith the advent of big data, data center applications are processing vast amounts of unstructured and semi-structured data, in parallel on large clusters, across hundreds to thousands of nodes. The highest performance for these batch big data workloads is achieved using expensive network equipment with large buffers, which accommodate bursts in network traffic and allocate bandwidth fairly even when the network is congested. Throughput-sensitive big data applications are, however, often executed in the same data center as latency-sensitive workloads. For both workloads to be supported well, the network must provide both maximum throughput and low latency. Progress has been made in this direction, as modern network switches support Active Queue Management (AQM) and Explicit Congestion Notifications (ECN), both mechanisms to control the level of queue occupancy, reducing the total network latency. This paper is the first study of the effect of Active Queue Management on both throughput and latency, in the context of Hadoop and the MapReduce programming model. We give a quantitative comparison of four different approaches for controlling buffer occupancy and latency: RED and CoDel, both standalone and also combined with ECN and DCTCP network protocol, and identify the AQM configurations that maintain Hadoop execution time gains from larger buffers within 5%, while reducing network packet latency caused by bufferbloat by up to 85%. Finally, we provide recommendations to administrators of Hadoop clusters as to how to improve latency without degrading the throughput of batch big data workloads.
dc.description.sponsorshipThe research leading to these results has received funding from the European Unions Seventh Framework Programme (FP7/2007–2013) under grant agreement number 610456 (Euroserver). The research was also supported by the Ministry of Economy and Competitiveness of Spain under the contracts TIN2012-34557 and TIN2015-65316-P, Generalitat de Catalunya (contracts 2014-SGR-1051 and 2014-SGR-1272), HiPEAC-3 Network of Excellence (ICT- 287759), and the Severo Ochoa Program (SEV-2011-00067) of the Spanish Government.
dc.format.extent8 p.
dc.language.isoeng
dc.publisherIEEE
dc.subjectÀrees temàtiques de la UPC::Informàtica
dc.subject.lcshData centers
dc.subject.lcshBig data
dc.subject.lcshHadoop
dc.subject.otherData Center
dc.subject.otherMapReduce
dc.subject.otherHadoop
dc.subject.otherDCTCP
dc.subject.otherECN
dc.titleControlling Network Latency in Mixed Hadoop Clusters: Do We Need Active Queue Management?
dc.typeConference report
dc.subject.lemacMacrodades
dc.subject.lemacProgramació
dc.identifier.doi10.1109/LCN.2016.70
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://ieeexplore.ieee.org/document/7796816/
dc.rights.accessOpen Access
dc.description.versionPostprint (author's final draft)
dc.relation.projectidinfo:eu-repo/grantAgreement/MINECO//TIN2015-65316-P/ES/COMPUTACION DE ALTAS PRESTACIONES VII/
local.citation.contributor41st Conference on Local Computer Networks (LCN), 7-10 Nov. 2016
local.citation.publicationNameLocal Computer Networks (LCN), 2016 IEEE 41st Conference on
local.citation.startingPage415
local.citation.endingPage423


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record