How long until the Humanity's Last Exam benchmark gets saturated? (90%+)
AMA with OpenAI’s Sam Altman, Mark Chen, Kevin Weil, Srinivas Narayanan, Michelle Pokrass, and Hongyu Ren
What the suspense feels like
This market is fucking delusional
o3-mini and o3-mini-high are rolling out shortly in ChatGPT
It's the end of January....
Grok (xAi) developers response to Anthropic and OpenAIs reaction to DeepSeek
Anthropic CEO says blocking AI chips to China is of existential importance after DeepSeeks release in new blog post.
Anduril's founder gives his take on DeepSeek
DeepSeek is the first ever LLM to have as much google searches as ChatGPT does, indicating that the new model could be the first direct competitor to OpenAI.
Sama on DeepSeek
[Suggestion] Moratorium on all deepseek related posts
Nasdaq 100 futures crash -1,100 POINTS as pre-market selling accelerates on worries of DeepSeek dethroning US Tech.
Massive wave of chinese propaganda
Can we give Google some credit for currently having the only model in the top 5 without "thinking"? Their full thinking model is probably going to be #1 once its finished testing.
What is making you stay with Claude after DeepSeek-R1's release?
o1 can no longer count number of r's in strawberry while legacy gpt-4 can
Hilarious simple DeepSeek-R1 prompt demonstrates how human its thinking is
Blir stresset av utviklerbransjen
Rumors of industry panic caused by DeepSeek
Humanity's Last Exam dataset is out!
What DeepSeek just did is insane. You can now do complex o1 level reasoning CHEAPER than what a regular ChatGPT-4o prompt costs.
PirateSoftware drinks water
Claude just referred to me as, "The human" ... Odd response ... Kind of creeped me out. This is from 3.5 Sonnet. My custom instructions are, "Show your work in all responses with tags>'