Tencent’s tech team has optimized DeepSeek’s open-source DeepEP communication framework,shaved pussy | Adult Movies Online boosting its performance across different network environments, according to the Chinese AI startup. Testing showed a 100% improvement on RoCE networks and a 30% gain on InfiniBand (IB), offering more efficient solutions for AI model training. On GitHub, DeepSeek acknowledged the Chinese tech giant’s contribution had led to a “huge speedup.” DeepEP is a communication library tailored for a mixture of experts (MoE) and expert parallelism (EP), supporting high-throughput, low-latency GPU kernels and low-precision computing, including FP8. Tencent’s Starlink Networking team identified two main bottlenecks: underutilized dual-port NIC bandwidth and CPU control latency. After targeted optimizations, performance doubled on RoCE and improved by 30% on IB. The enhanced framework is now fully open-source and has been successfully deployed in training Tencent’s Hunyuan large model, demonstrating strong versatility within environments built on Tencent’s Starlink and H20 servers, Chinese tech media outlet iThome reported. [iThome, in Chinese]
Related Articles
2025-06-26 05:45
476 views
Astronomers saw one galaxy impale another. The damage was an eye
Two extremely distant galaxies appear to be ramming into each other over and over again at speeds of
Read More
2025-06-26 04:55
2976 views
How to customize your Facebook News Feed so it's less annoying
The latest hoax that the Facebook world soundly fell for was the idea that only 26 (or 25) friends c
Read More
2025-06-26 02:59
306 views
Star Wars hires 'Game of Thrones' writers and the jokes are just fire
The internet may have broken on Tuesday with news that Game of Thrones show runners David Benioff an
Read More