On March 6,family Archives Alibaba released and open-sourced its new reasoning model, QwQ-32B, featuring 32 billion parameters. Despite being significantly smaller than DeepSeek-R1, which has 6,710 billion parameters (with 3.7 billion active), QwQ-32B matches its performance in various benchmarks. QwQ-32B excelled in math and coding tests, outperforming OpenAI’s o1-mini and distilled versions of DeepSeek-R1. It also scored higher than DeepSeek-R1 in some evaluations like LiveBench and IFEval. The model leverages reinforcement learning and integrates agent capabilities for critical thinking and adaptive reasoning. Notably, QwQ-32B requires much less computational power, making it deployable on consumer-grade hardware. This release aligns with Alibaba’s AI strategy, which includes significant investments in cloud and AI infrastructure. Following the release, Alibaba’s US stock rose 8.61% to $141.03, with Hong Kong shares up over 7%.[Jiemian, in Chinese]
Related Articles
2025-06-26 00:48
1338 views
A Supposedly Fun Thing I’ll Never Make Money from Again by Jenn Shapland
A Supposedly Fun Thing I’ll Never Make Money from AgainBy Jenn ShaplandJanuary 28, 2020Procrastinati
Read More
2025-06-26 00:34
1208 views
The Paris Review Crossword by Adrienne Raphel
The Paris Review CrosswordBy Adrienne RaphelMarch 11, 2020Department of Tomfooleryphoto by Wil540art
Read More