文献
J-GLOBAL ID:201702218583906580
整理番号:17A1385769
多目的強化学習のための報酬変化におけるポリシー不変性【Powered by NICT】
Policy invariance under reward transformations for multi-objective reinforcement learning
著者 (6件):
Mannion Patrick
(Department of Computer Science & Applied Physics, Galway-Mayo Institute of Technology, Dublin Road, Galway, Ireland)
,
Mannion Patrick
(Discipline of Information Technology, National University of Ireland Galway, University Road, Galway, Ireland)
,
Devlin Sam
(Department of Computer Science, University of York, Deramore Lane, York, UK)
,
Mason Karl
(Discipline of Information Technology, National University of Ireland Galway, University Road, Galway, Ireland)
,
Duggan Jim
(Discipline of Information Technology, National University of Ireland Galway, University Road, Galway, Ireland)
,
Howley Enda
(Discipline of Information Technology, National University of Ireland Galway, University Road, Galway, Ireland)
資料名:
Neurocomputing
(Neurocomputing)
巻:
263
ページ:
60-73
発行年:
2017年
JST資料番号:
W0360A
ISSN:
0925-2312
資料種別:
逐次刊行物 (A)
記事区分:
原著論文
発行国:
オランダ (NLD)
言語:
英語 (EN)