Product

o1

OpenAI o1

OpenAI o1 is a reasoning-focused model series released on September 13, 2024, built on reinforcement learning rather than the GPT architecture — OpenAI's first product of its "Strawberry" project . Its core innovation is test-time compute scaling: the model spends seconds to minutes internally reasoning through problems via a hidden chain-of-thought process, testing approaches and correcting mistakes before answering . This yields dramatic gains in STEM domains — on the IMO qualifying exam, o1 scored 83% versus GPT-4o's 13% — though it lacks GPT-4o's browsing, file upload, and image processing capabilities . The release included o1-preview and a smaller o1-mini, with API access initially restricted to tier-5 developers . OpenAI researchers framed this as a paradigm shift from scaling training compute to scaling inference-time reasoning, with Greg Brockman calling it "System 2 thinking" unlocked through reinforcement learning-trained chain-of-thought .

AI-generated — may contain errors, please verify.

o1Product

OpenAI o1

渲染中…

Mentioned in 6 articles

o1

Coverage

Moonshot AI Releases Visual Reasoning Model k1, Outperforming OpenAI o1 in STEM Benchmarks | Z News

30,000-Word Transcript: A Google DeepMind Researcher on Deconstructing OpenAI o1 and the LLM+RL Paradigm | Z Talk

Yichao Ji, Peak: A Small Step Toward Replicating OpenAI o1 — Steiner Open-Source Model Progress Report | Z Talk

Moonshot AI Founder Zhilin Yang's Latest Take: Deep Reflections on OpenAI's o1 Paradigm Shift | Z Talk