原文传递 Concurrent Learning of Control in Multi-agent Sequential Decision Tasks.
题名: Concurrent Learning of Control in Multi-agent Sequential Decision Tasks.
作者: Banerjee, B.
关键词: Multiagent systems, Models, Learning, Artificial intelligence, Marl(multi-agent reinforcement learning), Dec-pomdp(decentralized partially observable markov decision process)
摘要: The overall objective of this project was to develop multi-agent reinforcement learning (MARL) approaches for intelligent agents to autonomously learn distributed control policies in decentralized partially observable Markov decision processes (Dec-POMDPs), without prior knowledge of the model parameters.
报告类型: 科技报告
检索历史
应用推荐