Minimizing Negative Side Effects in Cooperative Multi-Agent Systems using Distributed Coordination
Minimizing Negative Side Effects in Cooperative Multi-Agent Systems using Distributed Coordination
Blog Article
Autonomous agents operating in real-world environments frequently encounter undesirable outcomes or negative side effects (NSEs) when working collaboratively alongside other agents.Even when agents can execute their primary task optimally when operating in isolation, their training Birth Flower Name Necklace may not account for potential negative interactions that arise in the presence of other agents.We frame the challenge of minimizing NSEs as a lexicographic decentralized Markov decision process in which we assume independence of rewards and transitions with respect to the primary assigned tasks, but recognize that addressing negative side effects creates a form of dependence among the agents.
We present a lexicographic Q-learning approach to mitigate the NSEs using human feedback models while maintaining near-optimality with respect to the assigned tasks---up to some given slack.Our empirical evaluation across two domains Chargers demonstrates that our collaborative approach effectively mitigates NSEs, outperforming non-collaborative methods.