News

Abstract: This article considers a distributed stochastic nonconvex optimization problem, where the nodes in a network cooperatively minimize a sum of L -smooth local cost functions with sparse ...
In this article, we propose a distributional policy-gradient method based on distributional reinforcement learning (RL) and policy gradient. Conventional RL algorithms typically estimate the ...