Maximum diffusion reinforcement learning - Nature Machine Intelligence