Concordia Contest NeurIPS 2024 — LLM-based Multi-Agent Cooperation

How do language agents behave when cooperation is possible, but not guaranteed? This project explored that question through the Concordia Contest at NeurIPS 2024, where LLM-based agents were evaluated in mixed-motive social dilemma environments requiring skills such as negotiation, reciprocity, promise-keeping, reputation management, compromise, partner choice, and sanctioning. My work centred on the design and evaluation of agents that could cooperate not only in familiar scenarios, but also with unseen co-players in held-out environments. Through the Concordia framework, this research examined how language agents respond under cooperation pressure, and contributed to the co-authored paper Evaluating generalization capabilities of LLM-based agents in mixed-motive scenarios using concordia.”

Links: