Behavior Learning Based on a Policy Gradient Method: Separation of Environmental Dynamics and State-values in Policies

ISHIHARA SEIJI; IGARASHI HARUKAZU

Art

J-GLOBAL ID：200902284034284253 Reference number：08A0281862

Behavior Learning Based on a Policy Gradient Method: Separation of Environmental Dynamics and State-values in Policies

方策こう配法を用いた行動学習:環境のダイナミクスと行動知識とを分離した方策の表現

Publisher site Copy service {{ this.onShowCLink("http://jdream3.com/copy/?sid=JGLOBAL&noSystem=1&documentNoArray=08A0281862&COPY=1") }}
Access JDreamⅢ for advanced search and analysis. {{ this.onShowJLink("http://jdream3.com/lp/jglobal/index.html?docNo=08A0281862&from=J-GLOBAL&jstjournalNo=X0905A") }}

Author (2)： ,
Material：
Volume： 35th Page： 230-235 Publication year： 2008
JST Material Number： X0905A Document type： Proceedings
Article type：原著論文 Country of issue： Japan (JPN) Language： JAPANESE (JA)

, , , , , , , , ,
, , ,

Artificial intelligence

, , , , , , ,

Return to Previous Page