Swarm Reinforcement Learning Algorithms Based on a Particle Swarm Optimization. By Hitoshi Iima and Yasuaki Kuroe. IEEE 2008.
What they did
- Q-learning for individual learning, and then exchange information with other agents, taking on the best Q-learning results (need to be evaluated in some way). They developed three updating methods.