文献
J-GLOBAL ID:202202244015950013
整理番号:22A0428225
ネットワーク上の分散強化学習における完全非同期ポリシー評価【JST・京大機械翻訳】
Fully asynchronous policy evaluation in distributed reinforcement learning over networks
著者 (5件):
Sha Xingyu
(Department of Automation and Beijing National Research Center for Information Science and Technology, Tsinghua University, Beijing 100084, China)
,
Zhang Jiaqi
(Department of Automation and Beijing National Research Center for Information Science and Technology, Tsinghua University, Beijing 100084, China)
,
You Keyou
(Department of Automation and Beijing National Research Center for Information Science and Technology, Tsinghua University, Beijing 100084, China)
,
Zhang Kaiqing
(Laboratory for Information & Decision Systems, Massachusetts Institute of Technology, Cambridge, MA 02139, USA)
,
Basar Tamer
(Department of Electrical and Computer Engineering and Coordinated Science Laboratory, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA)
資料名:
Automatica
(Automatica)
巻:
136
ページ:
Null
発行年:
2022年
JST資料番号:
B0208A
ISSN:
0005-1098
資料種別:
逐次刊行物 (A)
記事区分:
原著論文
発行国:
オランダ (NLD)
言語:
英語 (EN)