tencent

Tag

Cards List
#tencent

@0xSero: Just added 2 new model compressions: Hy3-FP8 & NVFP4 I recommend trying this model it's very strong and fits on 256gb o…

X AI KOLs Following · yesterday Cached

0xSero has released new FP8 and NVFP4 quantized versions of the Tencent Hy3-preview model, enabling it to run on 256GB VRAM with full context.

0 favorites 0 likes
#tencent

@WinForKakei: Let me use Tencent as an example. Tencent's 2025 capex is even lower than guidance. As clearly stated in last year's earnings call, this is because they couldn't buy NVIDIA GPUs (due to AI chip supply constraints) and were unwilling to buy domestic chips. Of course, they compromised this year and have started ordering Kunlun chips. Actually, Pony Ma is not as Zen or content with being a latecomer as people say...

X AI KOLs Following · yesterday

The article discusses Tencent's AI capex constraints due to NVIDIA chip shortages and its recent shift to using Kunlun chips, analyzing the company's valuation and strategic positioning in the AI landscape.

0 favorites 0 likes
#tencent

UniPrefill: Universal Long-Context Prefill Acceleration via Block-wise Dynamic Sparsification

arXiv cs.CL · 3d ago Cached

UniPrefill is a new prefill acceleration framework proposed in a research paper that enables block-wise dynamic sparsification for universal long-context processing in LLMs. It integrates with vLLM to achieve up to 2.1x speedup in Time-To-First-Token across various model architectures.

0 favorites 0 likes
#tencent

HY-3 PREVIEW

Reddit r/LocalLLaMA · 2026-04-23 Cached

Tencent releases Hy3-preview, a 295B-parameter MoE model with 21B active parameters that excels in STEM reasoning, instruction following, coding and agent tasks.

0 favorites 0 likes
#tencent

Tencent, Alibaba in Talks to Invest in DeepSeek at $20 Billion-Plus Valuation

Reddit r/LocalLLaMA · 2026-04-22

Tencent and Alibaba are reportedly in talks to invest in Chinese AI startup DeepSeek at a valuation exceeding $20 billion.

0 favorites 0 likes
#tencent

@junyao_gao62882: The ImageNet moment for style transfer!! We have released the full code (training/inference), dataset, models of MegaSt…

X AI KOLs Following · 2026-04-21

Tencent releases MegaStyle, a large-scale open-source style transfer model with full training/inference code, 1.4M dataset, and pre-trained models.

0 favorites 0 likes
← Back to home

Submit Feedback