OpenAI news openai.com

via techmeme 3 months ago
OpenAI: OpenAI launches FrontierScience, a benchmark to measure models' expert-level scientific reasoning with 700+ questions, finding GPT-5.2 is its strongest model  —  We introduce FrontierScience, a new benchmark that evaluates AI capabilities for expert-level scientific reasoning across physics, chemistry, and biology.

No comments yet…

Login to comment.