text-generation

#text-generation

@Michaelzsguo: This is the best read on DeepSeek’s recent innovation, DSpark: Think of DSpark as: The main model rapidly brainstorms t…

X AI KOLs Timeline ↗ · 4h ago Cached

DeepSeek released DSpark, a system where the main model rapidly generates a sentence while a tiny editor fixes coherence before verification, pushing LLM systems engineering beyond new architecture.

0 favorites 0 likes

#text-generation

Automatic Generation of Highlights for Academic Paper Via Prompt-based Learning

arXiv cs.CL ↗ · 3d ago Cached

This paper investigates prompt-based learning for automatically generating highlights of academic papers, using models like GPT-2, T5, and ChatGPT, and shows that ChatGPT with few-shot prompts achieves performance comparable to or better than supervised methods without requiring task-specific training data.

0 favorites 0 likes

#text-generation

I'm eager for a 15x speedup on my strix halo

Reddit r/LocalLLaMA ↗ · 5d ago

Nvidia claims a 15x speedup in text generation using a diffusion model, generating entire blocks at once.

0 favorites 0 likes

#text-generation

GLM-5.2 Is The Best Open Weight Creative Writing Model

Reddit r/LocalLLaMA ↗ · 2026-06-18

GLM-5.2 is an open weight AI model optimized for creative writing tasks, claimed to be the best in its category.

0 favorites 0 likes

#text-generation

VoidPadding: Let [VOID] Handle Padding in Masked Diffusion Language Models so that [EOS] Can Focus on Semantic Termination

arXiv cs.CL ↗ · 2026-06-17 Cached

VoidPadding introduces a [VOID] token to handle padding in masked diffusion language models, allowing [EOS] to focus solely on semantic termination. This method significantly improves performance on reasoning and coding benchmarks while reducing decoding steps.

0 favorites 0 likes

#text-generation

Can gzip be a language model?

Lobsters Hottest ↗ · 2026-06-16 Cached

This article explores using the gzip compression algorithm as a language model, demonstrating that compression algorithms can generate text by scoring candidate continuations based on compressed length, using beam search to produce output.

0 favorites 0 likes

#text-generation

@nathanrs: I found out the other day that any compression tool can be contorted to do language modeling. Turns out gzip can genera…

X AI KOLs Following ↗ · 2026-06-16 Cached

The tweet describes how any compression tool, including gzip, can be adapted for language modeling, and that gzip can generate text that resembles Shakespeare. A write-up is linked.

0 favorites 0 likes

#text-generation

@volokuleshov: Congratulations to Google on open-sourcing Gemma Diffusion! I want to give a shout-out to a group of really talented Co…

X AI KOLs Timeline ↗ · 2026-06-11 Cached

Google has open-sourced DiffusionGemma, a novel diffusion-based text generation model that uses block diffusion and efficient encoder-decoder techniques, with contributions from Cornell University researchers.

0 favorites 0 likes

#text-generation

DiffusionGemma

Simon Willison's Blog ↗ · 2026-06-10 Cached

Google released DiffusionGemma, an open-weight text generation model (26B parameters, 4B active) under Apache 2 license, demonstrating high inference speeds via NVIDIA's NIM cloud API.

0 favorites 0 likes

#text-generation

Google's latest DiffusionGemma open AI model comes with a 4x speed boost

Ars Technica ↗ · 2026-06-10 Cached

Google released DiffusionGemma, an experimental open-source diffusion model for text generation that achieves 4x speed boost over autoregressive models, optimized for local processing.

0 favorites 0 likes

#text-generation

@_philschmid: Gemma goes diffusion! DiffusionGemma with up to 1000+ tokens per second! - Built on Gemma 4 as a 26B MoE model. - 3.8B …

X AI KOLs Following ↗ · 2026-06-10 Cached

DiffusionGemma, a 26B MoE model based on Gemma 4, achieves over 1000 tokens per second using diffusion for text generation in 256-token blocks, fitting in 18GB VRAM with quantization, released under Apache 2.0.

0 favorites 0 likes

#text-generation

DiffusionGemma: The Developer Guide- Google Developers Blog

Reddit r/LocalLLaMA ↗ · 2026-06-10 Cached

DiffusionGemma is a new experimental model from Google DeepMind that uses parallel generation on a 256-token canvas, achieving up to 4x faster token generation on GPUs. This developer guide explains its architecture, bidirectional context, and includes a fine-tuning recipe for solving Sudoku.

0 favorites 0 likes

#text-generation

DiffusionGemma: 4x Faster Text Generation

Hacker News Top ↗ · 2026-06-10 Cached

Google introduces DiffusionGemma, an experimental 26B MoE open model that achieves up to 4x faster text generation on GPUs using text diffusion, targeting speed-critical interactive local workflows.

0 favorites 0 likes

#text-generation

google/diffusiongemma-26B-A4B-it

Hugging Face Models Trending ↗ · 2026-06-09 Cached

Google DeepMind releases DiffusionGemma, a 26B-parameter Mixture-of-Experts model that uses discrete diffusion for faster text generation, supporting multimodal inputs and a 256K token context.

0 favorites 0 likes

#text-generation

Supportive Token Revealing for Fast Diffusion Language Model Decoding

arXiv cs.CL ↗ · 2026-06-04 Cached

This paper proposes AXON, a training-free module that improves the quality-latency trade-off of discrete diffusion language model decoding by intelligently selecting 'anchor' tokens to reveal first, using attention, uncertainty, and confidence signals to support subsequent denoising steps. Experiments on reasoning and code-generation benchmarks show AXON reduces function evaluations while maintaining or improving accuracy.

0 favorites 0 likes

#text-generation

Experience-Driven Dynamic Exits for LLMs with Reinforcement Learning

arXiv cs.CL ↗ · 2026-06-03 Cached

Introduces LEDE, a framework using offline reinforcement learning to dynamically select exit layers and speculation lengths for self-speculative decoding in LLMs, achieving up to 2.7x speedup over autoregressive decoding.

0 favorites 0 likes

#text-generation

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

Hugging Face Blog ↗ · 2026-05-23 Cached

NVIDIA introduces Nemotron-Labs Diffusion, a family of diffusion language models that generate text in parallel and iteratively refine it, offering faster generation and the ability to revise previous tokens.

0 favorites 0 likes

#text-generation

Drifting Objectives for Refining Discrete Diffusion Language Models

arXiv cs.CL ↗ · 2026-05-20 Cached

This paper introduces TokenDrift, a drifting objective that refines discrete diffusion language models by lifting categorical predictions to a continuous semantic space for anti-symmetric drifting, significantly improving generation quality under a fixed number of denoising steps.

0 favorites 0 likes

#text-generation

MiniGPT: Rebuilding GPT from First Principles

arXiv cs.CL ↗ · 2026-05-19 Cached

This paper presents MiniGPT, a compact from-scratch implementation of GPT-style autoregressive language modeling in PyTorch, built after studying nanoGPT. It evaluates the model on the Tiny Shakespeare dataset using character-level tokenization, achieving a validation loss of 1.4780 with a 10.77M-parameter configuration.

0 favorites 0 likes

#text-generation

Dynamic Chunking for Diffusion Language Models

arXiv cs.CL ↗ · 2026-05-18 Cached

This paper introduces Dynamic Chunking for Diffusion Language Models (DCDM), which replaces fixed positional blocks in block discrete diffusion with content-defined semantic chunks using a differentiable Chunking Attention mechanism, achieving consistent improvements across scales up to 1.5B parameters.

0 favorites 0 likes

text-generation

Submit Feedback