large-language-model

#large-language-model

@elonmusk: Grok foundation model V9-Medium (1.5T) has finished training. Evals look good. A lot of Cursor data was added in supple…

X AI KOLs Following ↗ · 2026-05-25 Cached

Elon Musk announced that the Grok foundation model V9-Medium (1.5T parameters) has finished training with strong evaluations, and will be publicly released in 2-3 weeks after fine-tuning and reinforcement learning.

0 favorites 0 likes

#large-language-model

@KtAIFeed: Straight to the point, no fluff. The recently popular Qwen 3.6 (35B/43B) latest open-source 'uncensored' model on Hugging Face (over a million downloads per month) can run locally with just 6GB VRAM on a single GPU. It completely breaks the original model's moral preaching and safety restrictions—no censorship, it will answer whatever you ask...

X AI KOLs Timeline ↗ · 2026-05-25 Cached

Introduces the Qwen 3.6 (35B/43B) open-source uncensored model, removing official moral and safety restrictions. Requires only 6GB VRAM for local operation. Over a million downloads.

0 favorites 0 likes

#large-language-model

@percyliang: Not only do we want to train a good model, we want to know it'll be good before we even start training. About a month a…

X AI KOLs Following ↗ · 2026-05-24 Cached

The Marin team pre-registered a predicted loss of 2.252 for a 129B parameter MoE model training run, and the actual result landed at 2.234, demonstrating accurate loss prediction before training.

0 favorites 0 likes

#large-language-model

Macaron-A2UI: A Model for Generative UI in Personal Agents

Hugging Face Daily Papers ↗ · 2026-05-24 Cached

Presents Macaron-A2UI, a model for generative UI in personal agents that synthesizes dynamic interfaces with lightweight executable actions, moving beyond text-only chat. The paper introduces a large-scale corpus, the A2UI-Bench benchmark, and trains models up to 754B parameters using LoRA fine-tuning and reinforcement learning, achieving strong results.

0 favorites 0 likes

#large-language-model

DeepSeek makes the V4 Pro price discount permanent

Hacker News Top ↗ · 2026-05-22 Cached

DeepSeek has made the 75% discount on V4 Pro API pricing permanent, reducing input/output token costs significantly.

0 favorites 0 likes

#large-language-model

@lxfater: NetEase Youdao open-sourced ZiYue 4 model, within 27B parameters, SOTA in math and science. But what really interests me is its voice feature!! Cloning a voice is nothing new, ElevenLabs could do it long ago. But they all share a common flaw: cross-language accent. Take your Chinese voice and use it to speak Japanese — it has a Chinese accent, you can tell it's a foreigner struggling...

X AI KOLs Timeline ↗ · 2026-05-22 Cached

NetEase Youdao open-sourced the ZiYue 4 model with 27B parameters, achieving SOTA in math and science; its voice feature supports 3-second cross-language voice cloning across 14 languages with no accent issue, along with open-sourcing the all-scenario intelligent agent 'Longxia' (Lobster).

0 favorites 0 likes

#large-language-model

VBFDD-Agent for Electric Vehicle Battery Fault Detection and Diagnosis: Descriptive Text Modeling of Battery Digital Signals

arXiv cs.AI ↗ · 2026-05-22 Cached

This paper proposes VBFDD-Agent, a vehicle battery fault detection and diagnosis agent that uses descriptive text modeling of battery signals, large language models, and historical cases to generate interpretable diagnostic results and maintenance recommendations for electric vehicle batteries.

0 favorites 0 likes

#large-language-model

Re. what ever happened to Cohere’s Command-A series of models?

Reddit r/LocalLLaMA ↗ · 2026-05-20

Cohere launches Command A+, its first Mixture-of-Experts model, released under Apache 2.0 with efficient quantization for 1-2 GPU deployment, prioritizing practicality and open access for developers.

0 favorites 0 likes

#large-language-model

@ClementDelangue: Cohere is on such a great open-source trajectory lately. Beautiful Apache 2.0 model! https://huggingface.co/CohereLabs/…

X AI KOLs Following ↗ · 2026-05-20 Cached

Cohere has released Command A+, an open-source model with 25 billion active parameters and 218B total parameters under Apache 2.0, optimized for agentic, multilingual, and reasoning-heavy tasks.

0 favorites 0 likes

#large-language-model

CohereLabs/command-a-plus-05-2026-bf16 · Hugging Face

Reddit r/LocalLLaMA ↗ · 2026-05-20 Cached

Cohere releases Command A+, an open-source model with 25B active parameters (218B total) optimized for agentic, multilingual, and reasoning-heavy tasks, supporting vision inputs and 128K context under Apache 2.0.

0 favorites 0 likes

#large-language-model

@seclink: Information Gap: Meituan's Longma large model, supports 55 million tokens per day, free upon registration ...

X AI KOLs Following ↗ · 2026-05-20 Cached

Meituan has launched its Longma large model, offering 55 million free tokens daily. Register and get free access.

0 favorites 0 likes

#large-language-model

Gemini flash is expensive!

Reddit r/singularity ↗ · 2026-05-19

The new Gemini Flash model is expensive to use, suggesting it may be a large but fast model.

0 favorites 0 likes

#large-language-model

Gemini 3.5 confirmed by google deepmind employee

Reddit r/singularity ↗ · 2026-05-19

A Google DeepMind employee has confirmed the existence of Gemini 3.5, the next iteration of Google's AI model.

0 favorites 0 likes

#large-language-model

Qwen 3.7 Has been Spotted on the Qwen website

Reddit r/singularity ↗ · 2026-05-18

A new version of Qwen, Qwen 3.7, has been spotted on the official Qwen website, suggesting an upcoming release.

0 favorites 0 likes

#large-language-model

Streaming Speech-to-Text Translation with a SpeechLLM

arXiv cs.CL ↗ · 2026-05-15 Cached

Presents a SpeechLLM architecture for streaming speech-to-text translation that adaptively decides when to output tokens based on audio, achieving 1-2 second latency with quality close to non-streaming baselines.

0 favorites 0 likes

#large-language-model

Useful Memories Become Faulty When Continuously Updated by LLMs

arXiv cs.AI ↗ · 2026-05-14 Cached

This paper shows that continuously consolidating past experiences into textual memory using LLMs degrades memory utility over time, and that preserving raw episodic trajectories outperforms forced consolidation, with implications for robust agentic memory systems.

0 favorites 0 likes

#large-language-model

The Trillion-Parameter Dilemma: MiMo-V2.5-Pro went open-source (1.02T params). Is self-hosting worth it when the API costs $70 for 387M tokens?

Reddit r/LocalLLaMA ↗ · 2026-05-13

Xiaomi open-sourced MiMo-V2.5-Pro, a 1.02 trillion parameter MoE model, prompting a cost-benefit analysis of using its API versus self-hosting for autonomous coding tasks.

0 favorites 0 likes

#large-language-model

Tested Xiaomi's MiMo V2.5 Pro for autonomous coding: 301 commits, 60+ pages, $70 in API costs. Now it's open-source.

Reddit r/ArtificialInteligence ↗ · 2026-05-13

Xiaomi has open-sourced its MiMo V2.5 Pro model, a 1.02T parameter MoE model designed for autonomous coding tasks. The article details a real-world test showing high efficiency with low API costs due to high cache hit rates.

1 favorites 1 likes

#large-language-model

AntAngelMed - 100a6b Healthcare LLM

Reddit r/LocalLLaMA ↗ · 2026-05-13 Cached

AntAngelMed is a newly open-sourced 100B-parameter medical language model developed by Zhejiang Health Information Center, Ant Healthcare, and Anzhen'er Medical AI. It achieves top rankings on HealthBench and MedAIBench, utilizing efficient MoE architecture for high-performance inference.

1 favorites 1 likes

#large-language-model

@TeksEdge: The world’s first open-source 100B medical LLM is here Local inferencers have a Health model option to run at home. Ant…

X AI KOLs Timeline ↗ · 2026-05-12

Zhejiang Health and Ant Healthcare released AntAngelMed, an open-source 100B parameter medical LLM that ranks top on MedBench and supports efficient local inference with high privacy.

0 favorites 0 likes

large-language-model

Submit Feedback