issues/25-04-16-ainews-qwq-32b-claims-to-match-deepseek-r1-671b #96

2025-05-22T09:32:54Z

giscus[bot]
bot May 22, 2025

issues/25-04-16-ainews-qwq-32b-claims-to-match-deepseek-r1-671b

Alibaba Qwen released their QwQ-32B model, a 32 billion parameter reasoning model using a novel two-stage reinforcement learning approach: first scaling RL for math and coding tasks with accuracy verifiers and code execution servers, then applying RL for general capabilities like instruction following and alignment. Meanwhile, OpenAI rolled out GPT-4.5 to Plus users, with mixed feedback on coding performance and noted inference cost improvements. The QwQ model aims to compete with larger MoE models like DeepSeek-R1. "GPT-4.5 is unusable for coding" was a notable user critique, while others praised its reasoning improvements due to scaling pretraining.

https://news.smol.ai/issues/25-04-16-ainews-qwq-32b-claims-to-match-deepseek-r1-671b

josephjang · 2025-05-22T09:32:55Z

josephjang
May 22, 2025 — with giscus

The date of this item should be 2025-03-06.

1 reply

josephjang May 22, 2025

I guess the owner of the site misconfigured it and the comment goes to the another dimension. ;)

trevortylerlee · 2025-05-23T00:39:35Z

trevortylerlee
May 23, 2025
Maintainer

This might help...

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

issues/25-04-16-ainews-qwq-32b-claims-to-match-deepseek-r1-671b #96

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

issues/25-04-16-ainews-qwq-32b-claims-to-match-deepseek-r1-671b #96

Uh oh!

giscus[bot] bot May 22, 2025

issues/25-04-16-ainews-qwq-32b-claims-to-match-deepseek-r1-671b

Replies: 2 comments · 1 reply

Uh oh!

josephjang May 22, 2025 — with giscus

Uh oh!

Uh oh!

josephjang May 22, 2025

Uh oh!

trevortylerlee May 23, 2025 Maintainer

giscus[bot]
bot May 22, 2025

Replies: 2 comments 1 reply

josephjang
May 22, 2025 — with giscus

trevortylerlee
May 23, 2025
Maintainer