@SpaceTimeViking: Announcing Orinth 1.0 AEON ULTIMATE UNCENSORED! BF16 and Quantized in NVFP4 for the DGX Spark / Blackwell arch. Preserv…

X AI KOLs Timeline 06/27/26, 09:49 PM Models

orinth uncensored bf16 quantization dgx-spark blackwell performance

Summary

Announcing Orinth 1.0 AEON ULTIMATE UNCENSORED, a model with BF16 and NVFP4 quantization for DGX Spark/Blackwell architecture, claiming 200-300% performance improvement with working DFlash.

Announcing Orinth 1.0 AEON ULTIMATE UNCENSORED! BF16 and Quantized in NVFP4 for the DGX Spark / Blackwell arch. Preserved Attention Layers at BF16 for lossless quality! QuickStart with WORKING DFLASH! 200%-300% Performance over stock! I also managed to get DFlash working. https://t.co/8fIyBohWB1

Original Article

View Cached Full Text

Cached at: 06/28/26, 01:57 AM

Announcing Orinth 1.0 AEON ULTIMATE UNCENSORED!

BF16 and Quantized in NVFP4 for the DGX Spark / Blackwell arch.

Preserved Attention Layers at BF16 for lossless quality!

QuickStart with WORKING DFLASH! 200%-300% Performance over stock!

I also managed to get DFlash working. https://t.co/8fIyBohWB1

Similar Articles

@SpaceTimeViking: Qwen3.6 27B getting some love on the new AEON ULTIMATE VLLM image @NVIDIAAI DGX SPARK OPTIMIZED! https://github.com/AEO…

X AI KOLs Timeline

AEON-7 releases a fully uncensored, capability-enhanced abliteration of Qwen3.6-27B, optimized for NVIDIA DGX Spark with NVFP4 quantization and DFlash speculative decoding for improved performance.

DavidAU/Qwen3.6-27B-Heretic-Uncensored-FINETUNE-NEO-CODE-Di-IMatrix-MAX-GGUF

Hugging Face Models Trending

A community-finetuned, uncensored version of the Qwen 3.6 27B model featuring high-precision GGUF quantizations.

@TheAhmadOsman: Luke Alonso has uploaded an NVFP4 of GLM 5.2 467GB, would fit on 4x DGX Sparks (~$20k)

X AI KOLs Following

Luke Alonso uploaded an NVFP4 quantized version of GLM 5.2 (467GB) that can fit on 4x DGX Sparks hardware, costing approximately $20k.

@sudoingX: i was running Ornith new 35b moe on llama.cpp with a Q4 quant, 4 bit, small, fast. it hit ~78 tok/s. then i swapped eng…

X AI KOLs Timeline

A 35B MoE agentic coding model called Ornith runs near lossless at FP8 on a single DGX Spark, achieving 3M token context and ~36 tok/s, with speculative decoding expected to boost speed further.

DavidAU/Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-NEO-CODE-Di-IMatrix-MAX-GGUF