Has masac been trained in the multiagent particle envs? Can it converge?
Has masac been trained in the multiagent particle envs? Can it converge?