reference/7364277

Predicate	Object
contentType	Journal Article\|Research Support, Non-U.S. Gov't
endingPage	593
issn	0899-7667 1530-888X
issueIdentifier	3
pageRange	563-593
publicationName	Neural Computation
startingPage	563
bibliographicCitation	Ma Y, Zhao T, Hatano K, Sugiyama M. An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions. Neural Comput. 2016 Mar;28(3):563–93. doi: 10.1162/neco_a_00808. PMID: 26735742.
creator	http://rdf.ncbi.nlm.nih.gov/pubchem/author/MD5_0d53a6eb1bf63715c4a067bd06ff055e http://rdf.ncbi.nlm.nih.gov/pubchem/author/MD5_042479631793d8c5ef504e483715e318 http://rdf.ncbi.nlm.nih.gov/pubchem/author/MD5_f8f0c583ff2645283866e0916f45c57c http://rdf.ncbi.nlm.nih.gov/pubchem/author/MD5_7cba7eeb29d40d0e4036210aae8c2380 http://rdf.ncbi.nlm.nih.gov/pubchem/author/ORCID_0000-0001-6658-6743
date	201603
identifier	https://doi.org/10.1162/neco_a_00808 https://pubmed.ncbi.nlm.nih.gov/26735742
isPartOf	https://portal.issn.org/resource/ISSN/0899-7667 https://portal.issn.org/resource/ISSN/1530-888X http://rdf.ncbi.nlm.nih.gov/pubchem/journal/20143
language	English
source	https://www.crossref.org/ https://pubmed.ncbi.nlm.nih.gov/
title	An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions

Showing number of triples: 1 to 24 of 24.