| Oct 15, 2025 | ποΈ Check out BigCodeArena, a human-in-the-loop platform for evaluating code through execution. |
| Sep 20, 2025 | π§π» Guru, our exploration of cross-domain RL for LLM reasoning, is accepted to NeurIPS 2025! |
| Jun 20, 2025 | π§π» Check out Guru: how cross-domain RL supercharges LLM reasoning. |
| Oct 10, 2024 | π€ We pre-release Decentralized Arena for automated, scalable, and transparent LLM evaluation. |
| Sep 20, 2024 | π DRPO is accepted to the main conference of EMNLP 2024! |
| Jul 10, 2024 | π LLM Reasoners is accepted to COLM 2024! |
| Feb 28, 2024 | π« We release StarCoder 2, a family of open LLMs for code. |
| Jan 16, 2024 | π RepoBench gets accepted to ICLR 2024! |
| Nov 18, 2023 | π₯³ ToolkenGPT receives best paper award at SoCal NLP 2023! |
| Sep 22, 2023 | π ToolkenGPT gets accepted to NeurIPS 2023 as an oral presentation! |