Ubiquitous Distributed Deep Reinforcement Learning at the Edge: Analyzing Byzantine Agents in Discrete Action Spaces

Jorge Peña Queralta; Li Qingqing; Tomi Westerlund; Wenshuai Zhao

Ubiquitous Distributed Deep Reinforcement Learning at the Edge: Analyzing Byzantine Agents in Discrete Action Spaces

Jorge Peña Queralta; Li Qingqing; Tomi Westerlund; Wenshuai Zhao

dc.contributor.author	Jorge Peña Queralta
dc.contributor.author	Li Qingqing
dc.contributor.author	Tomi Westerlund
dc.contributor.author	Wenshuai Zhao
dc.date.accessioned	2022-10-28T13:28:55Z
dc.date.available	2022-10-28T13:28:55Z
dc.identifier.issn	1877-0509
dc.identifier.uri	https://www.utupub.fi/handle/10024/165479
dc.description.abstract	<p>The integration of edge computing in next-generation mobile networks is bringing low-latency and high-bandwidth ubiquitous connectivity to a myriad of cyber-physical systems. This will further boost the increasing intelligence that is being embedded at the edge in various types of autonomous systems, where collaborative machine learning has the potential to play a significant role. This paper discusses some of the challenges in multi-agent distributed deep reinforcement learning that can occur in the presence of byzantine or malfunctioning agents. As the simulation-to-reality gap gets bridged, the probability of malfunctions or errors must be taken into account. We show how wrong discrete actions can significantly affect the collaborative learning effort. In particular, we analyze the effect of having a fraction of agents that might perform the wrong action with a given probability. We study the ability of the system to converge towards a common working policy through the collaborative learning process based on the number of experiences from each of the agents to be aggregated for each policy update, together with the fraction of wrong actions from agents experiencing malfunctions. Our experiments are carried out in a simulation environment using the Atari testbed for the discrete action spaces, and advantage actor-critic (A2C) for the distributed multi-agent training.<br /></p>
dc.language.iso	en
dc.relation.ispartofseries	Procedia Computer Science
dc.title	Ubiquitous Distributed Deep Reinforcement Learning at the Edge: Analyzing Byzantine Agents in Discrete Action Spaces
dc.identifier.urn	URN:NBN:fi-fe2021042827257
dc.relation.volume	177
dc.contributor.organization	fi=PÄÄT Sulautettu elektroniikka\|en=PÄÄT Embedded Electronics\|
dc.contributor.organization-code	2606802
dc.converis.publication-id	51395531
dc.converis.url	https://research.utu.fi/converis/portal/Publication/51395531
dc.format.pagerange	324
dc.format.pagerange	329
dc.identifier.jour-issn	1877-0509
dc.okm.affiliatedauthor	Li, Qingqing
dc.okm.affiliatedauthor	Peña Queralta, Jorge
dc.okm.affiliatedauthor	Westerlund, Tomi
dc.okm.discipline	113 Computer and information sciences	en_GB
dc.okm.discipline	213 Electronic, automation and communications engineering, electronics	en_GB
dc.okm.discipline	113 Tietojenkäsittely ja informaatiotieteet	fi_FI
dc.okm.discipline	213 Sähkö-, automaatio- ja tietoliikennetekniikka, elektroniikka	fi_FI
dc.okm.internationalcopublication	not an international co-publication
dc.okm.internationality	International publication
dc.okm.type	Conference proceedings article
dc.publisher.country	Netherlands	en_GB
dc.publisher.country	Alankomaat	fi_FI
dc.publisher.country-code	NL
dc.relation.conference	International Conference on Emerging Ubiquitous Systems and Pervasive Networks
dc.relation.doi	10.1016/j.procs.2020.10.043
dc.relation.ispartofjournal	Procedia Computer Science
dc.title.book	The 11th International Conference on Emerging Ubiquitous Systems and Pervasive Networks (EUSPN 2020) / The 10th International Conference on Current and Future Trends of Information and Communication Technologies in Healthcare (ICTH 2020) / Affiliated Works
dc.year.issued	2020

Aineistoon kuuluvat tiedostot

Nimi:: 1-s2.0-S1877050920323115-main.pdf
Koko:: 491.3Kb
Tiedostomuoto:: PDF
Kuvaus:: Publisher's PDF

Katso/Avaa

Aineisto kuuluu seuraaviin kokoelmiin

Rinnakkaistallenteet [19207]

Näytä suppeat kuvailutiedot