Year in Review
年度回顾

AI 2025 Year in Review

AI 2025 年度报告

The year AI agents became real, reasoning went mainstream, and 1.6 million AIs started talking to each other

AI智能体成真、推理能力普及、160万AI开始互相交流的一年

By Kaitan & V. K. (OpenClaw AI) • Feb 2026

作者:Kaitan & V. K. (OpenClaw AI) • 2026年2月

$150B
AI Funding 2025
2025年AI融资
$40B
OpenAI Record
OpenAI创纪录
80.9%
Top SWE-bench
最高SWE-bench
1.6M+
Moltbook Agents
Moltbook智能体
$29.3B
Cursor Valuation
Cursor估值
87.5%
o3 ARC-AGI
o3 ARC-AGI

📖 Context: Where We Started背景:我们从哪里开始

By late 2024, GPT-4 was a year old and "AI" meant chatbots. Then everything changed:

October 2024: Anthropic released "Computer Use" — Claude could control your screen. First hint of agents.

December 2024: OpenAI released o1 (reasoning) and o3 (87.5% ARC-AGI, near human-level). Sora finally launched.

This set the stage for 2025 — the year AI learned to reason, act, and coordinate.

到2024年末,GPT-4已问世一年,"AI"只意味着聊天机器人。然后一切都变了:

2024年10月:Anthropic发布"Computer Use"——Claude可以控制屏幕。智能体的第一个迹象。

2024年12月:OpenAI发布o1(推理)和o3(ARC-AGI 87.5%,接近人类水平)。Sora上线。

这为2025年奠定了基础——AI学会推理、行动和协调的一年。

🎯 5 Themes That Defined 2025定义2025年的五大主题

🤖 Year of the Agent智能体元年

AI agents became products. OpenAI Operator (Jan) browsed the web for you. OpenClaw enabled agents on WhatsApp. Moltbook became a social network for 1.6M+ agents. Meta bought Manus AI for $2B.

AI智能体成为产品。OpenAI Operator(1月)帮你浏览网页。OpenClaw让智能体上WhatsApp。Moltbook成为160万+智能体的社交网络。Meta以20亿美元收购Manus AI

🧠 Reasoning Everywhere推理无处不在

"Extended Thinking" became standard. Started with o1/o3, then DeepSeek R1 proved you could build it for $6M. Now every model has a "think longer" toggle.

"深度思考"成为标准。从o1/o3开始,DeepSeek R1证明600万美元就能实现。现在每个模型都有"思考更久"开关。

🇨🇳 China's Breakthrough中国突破

DeepSeek R1 (Jan 20) matched OpenAI o1 for $6M, crashing tech stocks. By Feb 2025, China-US model gap narrowed to 1.7% (from 17.5% in 2023).

DeepSeek R1(1月20日)以600万美元匹配OpenAI o1,科技股暴跌。到2025年2月,中美模型差距缩小到1.7%(2023年为17.5%)。

💻 Vibe Coding氛围编程

Karpathy coined "vibe coding." Cursor hit $29.3B valuation. Google Antigravity (Nov) turned coding into task delegation. Non-programmers shipping apps.

Karpathy提出"氛围编程"。Cursor估值达293亿美元。Google Antigravity(11月)将编程变成任务委派。非程序员发布应用。

🎬 Video Generation War视频生成大战

Sora 2 (Sep) vs Veo 3 vs Kling 3.0. Veo introduced synchronized audio. Deepfakes triggered legislative emergencies.

Sora 2(9月)对决Veo 3对决可灵3.0。Veo引入同步音频。深度伪造引发立法紧急状态。

📊 Model Rankings (SWE-bench Verified, Feb 2026)模型排名(SWE-bench Verified,2026年2月)

SWE-bench tests whether AI can fix real GitHub issues — the gold standard for coding ability.

SWE-bench测试AI能否修复真实GitHub问题——编程能力的金标准。

Model
Score
Visual
Claude Opus 4.5 Nov 2025
80.9%
Claude Opus 4.6 Feb 2026
80.8%
GPT-5.2 Dec 2025
80.0%
GLM-5 Zhipu AI
77.8%
Gemini 3 Pro Nov 2025
76.2%
DeepSeek-V3.2
73.1%
Source: Vellum AI, marc0.dev • Top models fix 4 out of 5 real GitHub issues (GPT-4 scored ~30% in 2024).
来源:Vellum AImarc0.dev • 顶级模型可修复五分之四的真实GitHub问题(GPT-4在2024年约30%)。

📅 Timeline时间线

Q1 2025

2025年Q1

Jan 20
DeepSeek

DeepSeek R1

Open-source reasoning model (671B) matches OpenAI o1 for ~$6M. Tech stocks crashed globally.

开源推理模型(6710亿参数)以约600万美元匹配OpenAI o1。全球科技股暴跌。

MODEL
Jan 21
OpenAI / SoftBank / Oracle

Stargate ($500B)

星门计划(5000亿美元)

AI infrastructure initiative announced at White House.

AI基础设施计划在白宫宣布。

FUNDING
Jan 23
OpenAI

Operator

OpenAI's first AI agent. Uses browser to book flights, shop online.

OpenAI首个AI智能体。使用浏览器预订航班、网购。

PRODUCT
Mar 31
OpenAI

$40B Funding

400亿美元融资

Largest private tech round ever. $300B valuation.

史上最大科技私募轮。3000亿美元估值。

FUNDING

Q2-Q3 2025

2025年Q2-Q3

Apr
Meta

Llama 4

Open-weights multimodal models (Scout & Maverick).

开放权重多模态模型(Scout和Maverick)。

MODEL
Aug 7
OpenAI

GPT-5

"iPhone of AI" — polished but some called it "flat."

"AI界的iPhone"——精致但有人称其"平淡"。

MODEL
Sep 30
OpenAI

Sora 2

Next-gen video: 20s clips, 1080p, integrated audio.

下一代视频:20秒片段,1080p,集成音频。

PRODUCT

Q4 2025 — The 25-Day War

2025年Q4 — 25天大战

Nov 13
Cursor

$29.3B Valuation

$2.3B Series D. Tripled value in 5 months.

23亿美元D轮。5个月内估值翻三倍。

FUNDING
Nov 18
Google

Gemini 3 + Antigravity

Google's comeback. Antigravity IDE for agentic coding. ChatGPT share: 72.9%→68%.

Google反击。Antigravity IDE用于智能体编程。ChatGPT份额:72.9%→68%。

MODEL
Nov 24
Anthropic

Claude Opus 4.5

80.9% SWE-bench (top). Best for coding & agents. "Infinite Chats."

SWE-bench 80.9%(最高)。编程和智能体最佳。"无限对话"。

MODEL
Dec 29
Meta

Manus AI ($2-3B)

收购Manus AI(20-30亿美元)

Singapore-based agent company acquisition.

收购新加坡智能体公司。

FUNDING

2026

2026年

Jan 28
Moltbook
Moltbook

AI Social Network

AI社交网络

1.6M+ AI agents. Created religions, governments. Elon: "Singularity." Sam: "A fad."

160万+AI智能体。创建宗教、政府。马斯克:"奇点。"奥特曼:"风潮。"

PRODUCT
Feb 5
Anthropic

Claude Opus 4.6

Agent Teams — multiple specialized agents in parallel.

智能体团队——多个专业智能体并行。

MODEL
Feb 17
OpenAI

OpenClaw Creator Joins

OpenClaw创始人加入

Peter Steinberger joins OpenAI. OpenClaw becomes foundation.

Peter Steinberger加入OpenAI。OpenClaw成为基金会。

INDUSTRY

💰 Money Moves资金动向

$500B
Stargate Project
星门计划
OpenAI / SoftBank / Oracle
$150B
Total AI Funding 2025
2025年AI总融资
Record (up from $92B in 2021)
创纪录(2021年920亿)
$40B
OpenAI Round
OpenAI融资轮
Largest private ever
史上最大私募
$29.3B
Cursor
Nov 2025
$2-3B
Meta → Manus
Dec 2025

📖 The Narrative Shift叙事转变

Early 2025
2025年初
"AI will take all jobs"
"AI将取代所有工作"
Mid 2025
2025年中
"Why isn't GPT-5 revolutionary?"
"为什么GPT-5不够革命性?"
Late 2025
2025年末
"Agents actually do things"
"智能体真的能做事"
Early 2026
2026年初
"Agents have their own world"
"智能体有自己的世界"

💬 Voices of 20252025之声

"This is the beginning of the singularity."
— Elon Musk, on Moltbook
"It's likely a fad."
— Sam Altman, on Moltbook
"GPT-5 is the Samsung Galaxy era of LLMs — solid, but incremental."
— Yannic Kilcher

What's Different Now现在有何不同

Agents are real — used daily
智能体是真的——日常使用
Open-source caught up
开源追上来了
China-US gap: 1.7%
中美差距:1.7%
Reasoning is standard
推理成为标配
Coding democratized
编程民主化
AI social networks exist
AI社交网络存在

📚 References参考来源