Multi-Agent Reinforcement Learning: Maximum Entropy, Rationality and Equilibrium
温颖，University College London
2019-10-11 14:00:00 ~ 2019-10-11 15:30:00
Room 1319, Software Expert Building
Weinan Zhang, Assistant Professor, John Hopcroft Center for Computer Science
In the multi-agent reinforcement learning (MARL), one common assumption is that all agents behave rationally during their interactions. For example, we assume agents' behaviors will converge to Nash equilibrium. However, in practice, it is hard to guarantee that all agents have same level of sophistication in terms of their abilities in understanding and learning from each other. Therefore, the effectiveness of MARL models decreases, especially when the opponents act irrationally.
Ying Wen is a final year Ph.D. Student in the Department of Computer Science, University College London, under the supervision of Prof. Jun Wang. His research interests are in the fields of reinforcement learning, multi-agent learning, and applications in real-world scenarios. More precisely, he is interested in the modeling dynamics and rationality in multi-agent reinforcement learning. He has published several papers in top-tier international conferences, such as ICLR, IJCAI, AAMAS, ICDM. He has over four years' experience working with tech companies to ground intelligent machine learning solutions in real business problems.