http://rdf.ncbi.nlm.nih.gov/pubchem/reference/7364277

Outgoing Links

Predicate Object
contentType Journal Article|Research Support, Non-U.S. Gov't
endingPage 593
issn 0899-7667
1530-888X
issueIdentifier 3
pageRange 563-593
publicationName Neural Computation
startingPage 563
bibliographicCitation Ma Y, Zhao T, Hatano K, Sugiyama M. An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions. Neural Comput. 2016 Mar;28(3):563–93. doi: 10.1162/neco_a_00808. PMID: 26735742.
creator http://rdf.ncbi.nlm.nih.gov/pubchem/author/MD5_0d53a6eb1bf63715c4a067bd06ff055e
http://rdf.ncbi.nlm.nih.gov/pubchem/author/MD5_042479631793d8c5ef504e483715e318
http://rdf.ncbi.nlm.nih.gov/pubchem/author/MD5_f8f0c583ff2645283866e0916f45c57c
http://rdf.ncbi.nlm.nih.gov/pubchem/author/MD5_7cba7eeb29d40d0e4036210aae8c2380
http://rdf.ncbi.nlm.nih.gov/pubchem/author/ORCID_0000-0001-6658-6743
date 201603
identifier https://doi.org/10.1162/neco_a_00808
https://pubmed.ncbi.nlm.nih.gov/26735742
isPartOf https://portal.issn.org/resource/ISSN/0899-7667
https://portal.issn.org/resource/ISSN/1530-888X
http://rdf.ncbi.nlm.nih.gov/pubchem/journal/20143
language English
source https://www.crossref.org/
https://pubmed.ncbi.nlm.nih.gov/
title An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions

Showing number of triples: 1 to 24 of 24.