http://rdf.ncbi.nlm.nih.gov/pubchem/patent/CN-108921298-B

Outgoing Links

Predicate Object
classificationCPCInventive http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G06N3-063
http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G06F18-23
classificationIPCInventive http://rdf.ncbi.nlm.nih.gov/pubchem/patentipc/G06N3-063
http://rdf.ncbi.nlm.nih.gov/pubchem/patentipc/G06N20-00
http://rdf.ncbi.nlm.nih.gov/pubchem/patentipc/G06K9-62
filingDate 2018-06-12^^<http://www.w3.org/2001/XMLSchema#date>
grantDate 2022-04-19^^<http://www.w3.org/2001/XMLSchema#date>
publicationDate 2022-04-19^^<http://www.w3.org/2001/XMLSchema#date>
publicationNumber CN-108921298-B
titleOfInvention Reinforcement learning for multi-agent communication and decision-making
abstract The invention discloses a multi-agent communication and decision-making method for reinforcement learning, comprising: extracting corresponding state features through a neural network according to the observation state information of each agent; Perform soft allocation and clustering to obtain the clustered communication information; distribute the clustered communication information to each agent, and each agent aggregates its own state characteristics with the received clustered communication information. Action decisions are made through a fully connected neural network inside the agent. This method can cluster the state information of each agent and communicate with other agents, thereby improving the agent's decision-making level.
priorityDate 2018-06-12^^<http://www.w3.org/2001/XMLSchema#date>
type http://data.epo.org/linked-data/def/patent/Publication

Incoming Links

Predicate Subject
isDiscussedBy http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID2950
http://rdf.ncbi.nlm.nih.gov/pubchem/substance/SID419481012

Showing number of triples: 1 to 15 of 15.