Meta released an agentic testing environment, Agents Research Environment, and a new benchmark called Gaia2 to measure agent's real-world adaptability.
Results that may be inaccessible to you are currently showing.
Hide inaccessible resultsResults that may be inaccessible to you are currently showing.
Hide inaccessible results