id author title date pages extension mime words sentences flesch summary cache txt cord-123804-cgvikrwm Liu, Changliu A Microscopic Epidemic Model and Pandemic Prediction Using Multi-Agent Reinforcement Learning 2020-04-27 .txt text/plain 5939 483 72 We first formulate a microscopic multi-agent epidemic model where every agent can choose its activity level that affects the spread of the disease. Then by minimizing agents' cost functions, we solve for the optimal decisions for individual agents in the framework of game theory and multi-agent reinforcement learning. Nash Equilibrium According to (7), the expect cost for an infected agent only depends on its own action. However, as shown in the right plot in Fig. 7(b) , the agents learned to flatten the curve faster than in case 1, mainly because healthy agents are more cautious (converge faster to low activity levels) when they start to consider cumulative costs. Note that when m k is high, the healthy agents still prefer low activity level, though the optimal actions for infected agents are low. ./cache/cord-123804-cgvikrwm.txt ./txt/cord-123804-cgvikrwm.txt