题名: | Concurrent Learning of Control in Multi-agent Sequential Decision Tasks. |
作者: | Banerjee, B. |
关键词: | Multiagent systems, Models, Learning, Artificial intelligence, Marl(multi-agent reinforcement learning), Dec-pomdp(decentralized partially observable markov decision process) |
摘要: | The overall objective of this project was to develop multi-agent reinforcement learning (MARL) approaches for intelligent agents to autonomously learn distributed control policies in decentralized partially observable Markov decision processes (Dec-POMDPs), without prior knowledge of the model parameters. |
报告类型: | 科技报告 |