DeepSeek 只花费29.4万美元训练了R1 model，真的假的？OPEN AI 咋办？

Crocodile · 帖子由 **Crocodile楼主** » 2025年 9月 18日 10:48

BEIJING (Reuters) -Chinese AI developer DeepSeek said it spent $294,000 on training its R1 model, much lower than figures reported for U.S. rivals, in a paper that is likely to reignite debate over Beijing's place in the race to develop artificial intelligence.

https://finance.yahoo.com/news/chinas-d ... 14733.html

LiSheQiang · 帖子由 **LiSheQiang** » 2025年 9月 18日 11:08

吹，接着吹，lol

Crocodile · 帖子由 **Crocodile楼主** » 2025年 9月 18日 11:11

LiSheQiang 写了： 2025年 9月 18日 11:08
吹，接着吹，lol

哪怕多上100倍，也只有2940万美元，不到OPEN AI 开销的一个零头。

牛河梁

Crocodile 写了： 2025年 9月 18日 11:11
哪怕多上100倍，也只有2940万美元，不到OPEN AI 开销的一个零头。

市场只关心销售有没有OpenAi的零头。

manba · 帖子由 **manba** » 2025年 9月 18日 11:15

牛逼吹多了大家都不信了

Crocodile · 帖子由 **Crocodile楼主** » 2025年 9月 18日 11:18

牛河梁写了： 2025年 9月 18日 11:15
市场只关心销售有没有OpenAi的零头。

美国市场被部分禁用，在中国用的人还是很多吧？

goodegg

DEEPSEEK R1 昨天刚发的nature
楼上老逼觉得他们比nature的审稿人更懂AI

Crocodile · 帖子由 **Crocodile楼主** » 2025年 9月 18日 11:24

goodegg 写了： 2025年 9月 18日 11:19
DEEPSEEK R1 昨天刚发的nature
楼上老逼觉得他们比nature的审稿人更懂AI

https://www.nature.com/articles/s41586-025-09422-z
DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning
Abstract
General reasoning represents a long-standing and formidable challenge in artificial intelligence (AI). Recent breakthroughs, exemplified by large language models (LLMs)1,2 and chain-of-thought (CoT) prompting3, have achieved considerable success on foundational reasoning tasks. However, this success is heavily contingent on extensive human-annotated demonstrations and the capabilities of models are still insufficient for more complex problems. Here we show that the reasoning abilities of LLMs can be incentivized through pure reinforcement learning (RL), obviating the need for human-labelled reasoning trajectories. The proposed RL framework facilitates the emergent development of advanced reasoning patterns, such as self-reflection, verification and dynamic strategy adaptation. Consequently, the trained model achieves superior performance on verifiable tasks such as mathematics, coding competitions and STEM fields, surpassing its counterparts trained through conventional supervised learning on human demonstrations. Moreover, the emergent reasoning patterns exhibited by these large-scale models can be systematically used to guide and enhance the reasoning capabilities of smaller models.

牛河梁

goodegg 写了： 2025年 9月 18日 11:19
DEEPSEEK R1 昨天刚发的nature
楼上老逼觉得他们比nature的审稿人更懂AI

只有没用的东西才（开源）灌水 — 牛河梁 @ 买买提

牛河梁

Crocodile 写了： 2025年 9月 18日 11:18
美国市场被部分禁用，在中国用的人还是很多吧？

不懂就问：有多少？

不懂就问：美国禁了？

Crocodile · 帖子由 **Crocodile楼主** » 2025年 9月 18日 11:26

牛河梁写了： 2025年 9月 18日 11:25
不懂就问：有多少？

不懂就问：美国禁了？

在美国的政府部门，州立大学禁止使用。

中国的事情，我不清楚。

牛河梁

Crocodile 写了： 2025年 9月 18日 11:26
在政府部门，州立大学禁止使用。

中国政府也不用/禁用OpenAi等吧。中国政府用户比美国政府多吧。

Mountainlion · 帖子由 **Mountainlion** » 2025年 9月 18日 11:37

这种文章主要从cience角度看。
至于花了多少，除了ds，没人知道，没人在意

goodegg 写了： 2025年 9月 18日 11:19
DEEPSEEK R1 昨天刚发的nature
楼上老逼觉得他们比nature的审稿人更懂AI

勇敢的小猫咪

牛河梁写了： 2025年 9月 18日 11:24
只有没用的东西才（开源）灌水 — 牛河梁 @ 买买提

啊？安卓，Linux 瑟瑟发抖…

新未名空间

DeepSeek 只花费29.4万美元训练了R1 model，真的假的？OPEN AI 咋办？

#1 DeepSeek 只花费29.4万美元训练了R1 model，真的假的？OPEN AI 咋办？

#2 Re: DeepSeek 只花费29.4万美元训练了R1 model，真的假的？OPEN AI 咋办？

#3 Re: DeepSeek 只花费29.4万美元训练了R1 model，真的假的？OPEN AI 咋办？

#4 Re: DeepSeek 只花费29.4万美元训练了R1 model，真的假的？OPEN AI 咋办？

#5 Re: DeepSeek 只花费29.4万美元训练了R1 model，真的假的？OPEN AI 咋办？

#6 Re: DeepSeek 只花费29.4万美元训练了R1 model，真的假的？OPEN AI 咋办？

#7 Re: DeepSeek 只花费29.4万美元训练了R1 model，真的假的？OPEN AI 咋办？

#8 Re: DeepSeek 只花费29.4万美元训练了R1 model，真的假的？OPEN AI 咋办？

#9 Re: DeepSeek 只花费29.4万美元训练了R1 model，真的假的？OPEN AI 咋办？

#10 Re: DeepSeek 只花费29.4万美元训练了R1 model，真的假的？OPEN AI 咋办？

#11 Re: DeepSeek 只花费29.4万美元训练了R1 model，真的假的？OPEN AI 咋办？

#12 Re: DeepSeek 只花费29.4万美元训练了R1 model，真的假的？OPEN AI 咋办？

#13 Re: DeepSeek 只花费29.4万美元训练了R1 model，真的假的？OPEN AI 咋办？

#14 Re: DeepSeek 只花费29.4万美元训练了R1 model，真的假的？OPEN AI 咋办？