patent/CN-108921298-B

http://rdf.ncbi.nlm.nih.gov/pubchem/patent/CN-108921298-B

Outgoing Links

Predicate	Object
classificationCPCInventive	http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G06N3-063 http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G06F18-23
classificationIPCInventive	http://rdf.ncbi.nlm.nih.gov/pubchem/patentipc/G06N3-063 http://rdf.ncbi.nlm.nih.gov/pubchem/patentipc/G06N20-00 http://rdf.ncbi.nlm.nih.gov/pubchem/patentipc/G06K9-62
filingDate	2018-06-12^^<http://www.w3.org/2001/XMLSchema#date>
grantDate	2022-04-19^^<http://www.w3.org/2001/XMLSchema#date>
publicationDate	2022-04-19^^<http://www.w3.org/2001/XMLSchema#date>
publicationNumber	CN-108921298-B
titleOfInvention	Reinforcement learning for multi-agent communication and decision-making
abstract	The invention discloses a multi-agent communication and decision-making method for reinforcement learning, comprising: extracting corresponding state features through a neural network according to the observation state information of each agent; Perform soft allocation and clustering to obtain the clustered communication information; distribute the clustered communication information to each agent, and each agent aggregates its own state characteristics with the received clustered communication information. Action decisions are made through a fully connected neural network inside the agent. This method can cluster the state information of each agent and communicate with other agents, thereby improving the agent's decision-making level.
priorityDate	2018-06-12^^<http://www.w3.org/2001/XMLSchema#date>
type	http://data.epo.org/linked-data/def/patent/Publication

Incoming Links

Predicate	Subject
isDiscussedBy	http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID2950 http://rdf.ncbi.nlm.nih.gov/pubchem/substance/SID419481012

Showing number of triples: 1 to 15 of 15.