🍫 Introducing CocoaBench, an evaluation framework for evaluating general agents’ compositional cognitive abilities.