Art
J-GLOBAL ID:200902284034284253   Reference number:08A0281862

Behavior Learning Based on a Policy Gradient Method: Separation of Environmental Dynamics and State-values in Policies

方策こう配法を用いた行動学習:環境のダイナミクスと行動知識とを分離した方策の表現
Author (2):
Material:
Volume: 35th  Page: 230-235  Publication year: 2008 
JST Material Number: X0905A  Document type: Proceedings
Article type: 原著論文  Country of issue: Japan (JPN)  Language: JAPANESE (JA)
Thesaurus term:
Thesaurus term/Semi thesaurus term
Keywords indexed to the article.
All keywords is available on JDreamIII(charged).
On J-GLOBAL, this item will be available after more than half a year after the record posted. In addtion, medical articles require to login to MyJ-GLOBAL.

Semi thesaurus term:
Thesaurus term/Semi thesaurus term
Keywords indexed to the article.
All keywords is available on JDreamIII(charged).
On J-GLOBAL, this item will be available after more than half a year after the record posted. In addtion, medical articles require to login to MyJ-GLOBAL.

JST classification (1):
JST classification
Category name(code) classified by JST.
Artificial intelligence 

Return to Previous Page