Efficient deep ensembles by averaging neural networks in parameter space

Norris Mitchell, Philip

Visualitza/Obre

memoria.pdf (1,376Mb)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Norris Mitchell, Philip

Tutor / directorAgudo Martínez, Antonio

; Ruiz Ovejero, Adrià

Tipus de documentProjecte Final de Màster Oficial

Data2021-10

Condicions d'accésAccés obert

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Abstract

Although deep ensembles provide large accuracy boosts relative to individual models, their use is not widespread in environments in which computational constraints are limited, as deep ensembles require storing M models and require M forward passes at prediction time. We propose a novel, computationally efficient alternative, which we name permAVG. Although deep ensembles cannot simply be average in parameter space, as all models find distinct perhaps distant local optima, permAVG exploits the symmetries of the loss landscape by learning permutations, such that all M models can be permuted into the same local optimum and can thereafter safely be averaged.

MatèriesArtificial intelligence, Intel·ligència artificial

TitulacióMÀSTER UNIVERSITARI EN MATEMÀTICA AVANÇADA I ENGINYERIA MATEMÀTICA (Pla 2010)

URIhttp://hdl.handle.net/2117/356936

Col·leccions

Màsters oficials - Master of Science in Advanced Mathematics and Mathematical Engineering (MAMME) [295]

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
memoria.pdf		1,376Mb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

Efficient deep ensembles by averaging neural networks in parameter space

Visualitza/Obre

Explora