Mostra el registre d'ítem simple

dc.contributor.authorCasas, Marc
dc.contributor.authorBronevetsky, Greg
dc.contributor.otherBarcelona Supercomputing Center
dc.date.accessioned2017-08-29T08:52:46Z
dc.date.available2019-09-01T00:25:42Z
dc.date.issued2017-09
dc.identifier.citationCasas, M.; Bronevetsky, G. Prediction of the impact of network switch utilization on application performance via active measurement. "Parallel Computing", Setembre 2017, vol. 67, p. 38-56.
dc.identifier.issn0167-8191
dc.identifier.urihttp://hdl.handle.net/2117/107213
dc.description.abstractAlthough one of the key characteristics of High Performance Computing (HPC) infrastructures are their fast interconnecting networks, the increasingly large computational capacity of HPC nodes and the subsequent growth of data exchanges between them constitute a potential performance bottleneck. To achieve high performance in parallel executions despite network limitations, application developers require tools to measure their codes’ network utilization and to correlate the network’s communication capacity with the performance of their applications. This paper presents a new methodology to measure and understand network behavior. The approach is based in two different techniques that inject extra network communication. The first technique aims to measure the fraction of the network that is utilized by a software component (an application or an individual task) to determine the existence and severity of network contention. The second injects large amounts of network traffic to study how applications behave on less capable or fully utilized networks. The measurements obtained by these techniques are combined to predict the performance slowdown suffered by a particular software component when it shares the network with others. Predictions are obtained by considering several training sets that use raw data from the two measurement techniques. The sensitivity of the training set size is evaluated by considering 12 different scenarios. Our results find the optimum training set size to be around 200 training points. When optimal data sets are used, the proposed methodology provides predictions with an average error of 9.6% considering 36 scenarios.
dc.description.sponsorshipWith the support of the Secretary for Universities and Research of the Ministry of Economy and Knowledge of the Government of Catalonia and the Cofund programme of the Marie Curie Actions of the 7th R&D Framework Programme of the European Union (Expedient 2013BP_B00243). The research leading to these results has received funding from the European Research Council under the European Union’s 7th FP (FP/2007-2013) /ERC GA n. 321253. Work partially supported by the Spanish Ministry of Science and Innovation (TIN2012-34557)
dc.format.extent19 p.
dc.language.isoeng
dc.publisherElsevier
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Spain
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subjectÀrees temàtiques de la UPC::Enginyeria electrònica
dc.subject.lcshHigh performance computing
dc.subject.lcshNetwork computers
dc.subject.lcshSoftware architecture
dc.subject.otherPerformance modeling
dc.subject.otherResource sharing
dc.subject.otherMeasurement techniques
dc.titlePrediction of the impact of network switch utilization on application performance via active measurement
dc.typeArticle
dc.subject.lemacSupercomputadors
dc.subject.lemacProgramari
dc.identifier.doi10.1016/j.parco.2017.06.005
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://www.sciencedirect.com/science/article/pii/S016781911730090X
dc.rights.accessOpen Access
dc.description.versionPostprint (author's final draft)
dc.relation.projectidinfo:eu-repo/grantAgreement/EC/FP7/321253/EU/Riding on Moore's Law/ROMOL
local.citation.publicationNameParallel Computing
local.citation.volume67
local.citation.startingPage38
local.citation.endingPage56


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple