@SpaceTimeViking: Announcing Orinth 1.0 AEON ULTIMATE UNCENSORED! BF16 and Quantized in NVFP4 for the DGX Spark / Blackwell arch. Preserv…
Summary
Announcing Orinth 1.0 AEON ULTIMATE UNCENSORED, a model with BF16 and NVFP4 quantization for DGX Spark/Blackwell architecture, claiming 200-300% performance improvement with working DFlash.
View Cached Full Text
Cached at: 06/28/26, 01:57 AM
Announcing Orinth 1.0 AEON ULTIMATE UNCENSORED!
BF16 and Quantized in NVFP4 for the DGX Spark / Blackwell arch.
Preserved Attention Layers at BF16 for lossless quality!
QuickStart with WORKING DFLASH! 200%-300% Performance over stock!
I also managed to get DFlash working. https://t.co/8fIyBohWB1
Similar Articles
@SpaceTimeViking: Qwen3.6 27B getting some love on the new AEON ULTIMATE VLLM image @NVIDIAAI DGX SPARK OPTIMIZED! https://github.com/AEO…
AEON-7 releases a fully uncensored, capability-enhanced abliteration of Qwen3.6-27B, optimized for NVIDIA DGX Spark with NVFP4 quantization and DFlash speculative decoding for improved performance.
DavidAU/Qwen3.6-27B-Heretic-Uncensored-FINETUNE-NEO-CODE-Di-IMatrix-MAX-GGUF
A community-finetuned, uncensored version of the Qwen 3.6 27B model featuring high-precision GGUF quantizations.
@TheAhmadOsman: Luke Alonso has uploaded an NVFP4 of GLM 5.2 467GB, would fit on 4x DGX Sparks (~$20k)
Luke Alonso uploaded an NVFP4 quantized version of GLM 5.2 (467GB) that can fit on 4x DGX Sparks hardware, costing approximately $20k.
@sudoingX: i was running Ornith new 35b moe on llama.cpp with a Q4 quant, 4 bit, small, fast. it hit ~78 tok/s. then i swapped eng…
A 35B MoE agentic coding model called Ornith runs near lossless at FP8 on a single DGX Spark, achieving 3M token context and ~36 tok/s, with speculative decoding expected to boost speed further.
DavidAU/Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-NEO-CODE-Di-IMatrix-MAX-GGUF
DavidAU releases a custom 40B parameter model based on Qwen 3.6, expanded and fine-tuned with Claude 4.6 Opus distill and Deckard datasets, featuring optimized GGUF quantizations for improved precision and uncensored capabilities.