G4-Meromero-31B-Uncensored-Heretic 现已发布,它是 Gemma 4 31B 的微调版本,专为创意任务设计,KLD为0.0100,拒绝率为15/100!
摘要
G4-Meromero-31B-Uncensored-Heretic 是 Gemma 4 31B 的微调版本,将拒绝率降低至15/100,同时保持KL散度为0.01,保留了模型质量。它专为创意任务设计,可在Hugging Face上以GGUF量化格式获取。
查看缓存全文
缓存时间: 2026/05/17 01:19
llmfan46/G4-MeroMero-31B-uncensored-heretic-GGUF · Hugging Face
来源:https://huggingface.co/llmfan46/G4-MeroMero-31B-uncensored-heretic-GGUF
🚨⚠️ 我已达到 Hugging Face 的免费存储上限 ⚠️🚨
除非我能承担额外存储的费用,否则无法再上传新模型。我作为独立贡献者托管了70多个免费模型,且这项工作无报酬。没有您的支持,将无法上传任何新模型。
🎉 Patreon(月度赞助)(https://patreon.com/LLMfan46) | ☕ Ko-fi(一次性赞助)(https://ko-fi.com/llmfan46)
每一笔贡献都将直接用于 Hugging Face 存储费用,以确保模型对所有人免费。
https://huggingface.co/llmfan46/G4-MeroMero-31B-uncensored-heretic-GGUF#85-fewer-refusals-15100-uncensored-vs-99100-original-while-preserving-model-quality-00100-kl-divergence拒绝率降低 85%(去审查版 15/100,原版 99/100),同时保持模型质量(KL 散度 0.0100)。
https://huggingface.co/llmfan46/G4-MeroMero-31B-uncensored-heretic-GGUF#%E2%9D%A4%EF%B8%8F-support-my-work❤️ 支持我的工作
创建这些模型需要大量的时间、工作和算力。如果您觉得它们有用,请考虑支持我:
image/png (https://huggingface.co/llmfan46/Omega-Darker-Gaslight_The-Final-Forgotten-Fever-Dream-24B-ultra-uncensored-heretic-v1/resolve/main/waifu001.webp)
您的帮助将激励我,并用于进一步改进工作流程、支付存储和算力费用,甚至可能通过租用云端 GPU 来帮助去审查更大的模型。
GGUF 量化版本来自 llmfan46/G4-MeroMero-31B-uncensored-heretic (https://huggingface.co/llmfan46/G4-MeroMero-31B-uncensored-heretic)。
https://huggingface.co/llmfan46/G4-MeroMero-31B-uncensored-heretic-GGUF#this-is-a-decensored-version-of-zerofatag4-meromero-31b-made-using-heretic-v120-with-the-arbitrary-rank-ablation-ara-method这是使用 Heretic v1.2.0 和 Arbitrary-Rank Ablation (ARA) 方法对 zerofata/G4-MeroMero-31B (https://huggingface.co/zerofata/G4-MeroMero-31B) 进行的去审查版本。
https://huggingface.co/llmfan46/G4-MeroMero-31B-uncensored-heretic-GGUF#abliteration-parametersAbliteration 参数
| 参数 | 值 |
|---|---|
| start_layer_index | 28 |
| end_layer_index | 49 |
| preserve_good_behavior_weight | 0.5600 |
| steer_bad_behavior_weight | 0.0001 |
| overcorrect_relative_weight | 0.9726 |
| neighbor_count | 10 |
https://huggingface.co/llmfan46/G4-MeroMero-31B-uncensored-heretic-GGUF#targeted-components目标组件
- attn.o_proj
https://huggingface.co/llmfan46/G4-MeroMero-31B-uncensored-heretic-GGUF#performance性能
| 指标 | 本模型 | 原始模型 (G4-MeroMero-31B) (https://huggingface.co/zerofata/G4-MeroMero-31B) |
|---|---|---|
| KL 散度 | 0.01000 | (依定义) |
| 拒绝率 | ✅ 15/100 | ❌ 99/100 |
较低的拒绝率表示内容限制更少,而较低的 KL 散度表示更接近原始模型基线。较高的拒绝率会导致更多的拒绝、反对、反驳、说教、审查、弱化和回避。
https://huggingface.co/llmfan46/G4-MeroMero-31B-uncensored-heretic-GGUF#mmlu-test-resultsMMLU 测试结果:
原始模型:
============================================================
- 总问题数:7021
- 正确数:6110
- 准确率:0.8702 (87.02%)
- 解析失败:24
============================================================
各科目得分:
- professional_law: 0.7694 (604/785)
- moral_scenarios: 0.8281 (366/442)
- miscellaneous: 0.9295 (356/383)
- professional_psychology: 0.9019 (285/316)
- high_school_psychology: 0.9704 (262/270)
- high_school_macroeconomics: 0.9289 (183/197)
- elementary_mathematics: 0.9457 (174/184)
- moral_disputes: 0.8621 (150/174)
- prehistory: 0.9302 (160/172)
- philosophy: 0.8616 (137/159)
- high_school_biology: 0.9539 (145/152)
- professional_accounting: 0.8322 (119/143)
- clinical_knowledge: 0.9286 (130/140)
- high_school_microeconomics: 0.9706 (132/136)
- nutrition: 0.9333 (126/135)
- professional_medicine: 0.9328 (125/134)
- conceptual_physics: 0.9141 (117/128)
- high_school_mathematics: 0.6614 (84/127)
- human_aging: 0.8362 (97/116)
- security_studies: 0.8839 (99/112)
- high_school_statistics: 0.8919 (99/111)
- marketing: 0.9633 (105/109)
- high_school_world_history: 0.9434 (100/106)
- sociology: 0.8932 (92/103)
- high_school_government_and_politics: 0.9703 (98/101)
- high_school_geography: 0.9293 (92/99)
- high_school_chemistry: 0.7732 (75/97)
- high_school_us_history: 0.9474 (90/95)
- virology: 0.5056 (45/89)
- college_medicine: 0.8636 (76/88)
- world_religions: 0.8977 (79/88)
- high_school_physics: 0.8095 (68/84)
- electrical_engineering: 0.8642 (70/81)
- astronomy: 0.9494 (75/79)
- logical_fallacies: 0.8816 (67/76)
- high_school_european_history: 0.9041 (66/73)
- anatomy: 0.8873 (63/71)
- college_biology: 0.9844 (63/64)
- human_sexuality: 0.9375 (60/64)
- formal_logic: 0.7812 (50/64)
- public_relations: 0.7541 (46/61)
- international_law: 0.9167 (55/60)
- college_physics: 0.7018 (40/57)
- college_mathematics: 0.8000 (44/55)
- econometrics: 0.7963 (43/54)
- jurisprudence: 0.8679 (46/53)
- high_school_computer_science: 0.9808 (51/52)
- machine_learning: 0.8654 (45/52)
- medical_genetics: 0.9608 (49/51)
- global_facts: 0.5882 (30/51)
- management: 0.9200 (46/50)
- us_foreign_policy: 0.9400 (47/50)
- college_chemistry: 0.6596 (31/47)
- abstract_algebra: 0.7872 (37/47)
- business_ethics: 0.8261 (38/46)
- college_computer_science: 0.9333 (42/45)
- computer_security: 0.8372 (36/43)
Heretic 去审查版:
============================================================
- 总问题数:7021
- 正确数:6096
- 准确率:0.8683 (86.83%)
- 解析失败:24
============================================================
各科目得分:
- professional_law: 0.7631 (599/785)
- moral_scenarios: 0.8235 (364/442)
- miscellaneous: 0.9269 (355/383)
- professional_psychology: 0.8956 (283/316)
- high_school_psychology: 0.9704 (262/270)
- high_school_macroeconomics: 0.9188 (181/197)
- elementary_mathematics: 0.9511 (175/184)
- moral_disputes: 0.8621 (150/174)
- prehistory: 0.9302 (160/172)
- philosophy: 0.8553 (136/159)
- high_school_biology: 0.9539 (145/152)
- professional_accounting: 0.8252 (118/143)
- clinical_knowledge: 0.9286 (130/140)
- high_school_microeconomics: 0.9559 (130/136)
- nutrition: 0.9185 (124/135)
- professional_medicine: 0.9403 (126/134)
- conceptual_physics: 0.9062 (116/128)
- high_school_mathematics: 0.6535 (83/127)
- human_aging: 0.8448 (98/116)
- security_studies: 0.8750 (98/112)
- high_school_statistics: 0.9009 (100/111)
- marketing: 0.9633 (105/109)
- high_school_world_history: 0.9528 (101/106)
- sociology: 0.9029 (93/103)
- high_school_government_and_politics: 0.9802 (99/101)
- high_school_geography: 0.9293 (92/99)
- high_school_chemistry: 0.7629 (74/97)
- high_school_us_history: 0.9368 (89/95)
- virology: 0.5056 (45/89)
- college_medicine: 0.8636 (76/88)
- world_religions: 0.9205 (81/88)
- high_school_physics: 0.7976 (67/84)
- electrical_engineering: 0.8765 (71/81)
- astronomy: 0.9494 (75/79)
- logical_fallacies: 0.8947 (68/76)
- high_school_european_history: 0.9178 (67/73)
- anatomy: 0.8873 (63/71)
- college_biology: 0.9688 (62/64)
- human_sexuality: 0.9375 (60/64)
- formal_logic: 0.7812 (50/64)
- public_relations: 0.7541 (46/61)
- international_law: 0.9167 (55/60)
- college_physics: 0.7193 (41/57)
- college_mathematics: 0.8000 (44/55)
- econometrics: 0.7963 (43/54)
- jurisprudence: 0.8679 (46/53)
- high_school_computer_science: 0.9808 (51/52)
- machine_learning: 0.8269 (43/52)
- medical_genetics: 0.9608 (49/51)
- global_facts: 0.5882 (30/51)
- management: 0.9200 (46/50)
- us_foreign_policy: 0.9600 (48/50)
- college_chemistry: 0.6170 (29/47)
- abstract_algebra: 0.8085 (38/47)
- business_ethics: 0.8478 (39/46)
- college_computer_science: 0.9111 (41/45)
- computer_security: 0.8372 (36/43)
MMLU – 大规模多任务语言理解,涵盖 57 个科目(数学、历史、法律、医学等)的多项选择题。
https://huggingface.co/llmfan46/G4-MeroMero-31B-uncensored-heretic-GGUF#quantizations量化版本
对于下方的 K-quant,必要时会将较小的 SSM 张量保留为更高精度。
Q6_K将ssm_alpha、ssm_beta和ssm_out保留为Q8_0。Q5_K、Q4_K和Q3_K量化将ssm_alpha和ssm_beta保留为Q8_0,而ssm_out保留为Q6_K。
这有助于在文件大小略微增加的情况下保留混合/SSM 块。
| 文件名 | 量化 | 描述 |
|---|---|---|
| G4-MeroMero-31B-uncensored-heretic-BF16.gguf | BF16 | 全精度 |
| G4-MeroMero-31B-uncensored-heretic-Q8_0.gguf | Q8_0 | 接近无损,推荐 |
| G4-MeroMero-31B-uncensored-heretic-Q6_K.gguf | Q6_K | 极好质量 |
| G4-MeroMero-31B-uncensored-heretic-Q5_K_M.gguf | Q5_K_M | 良好平衡 |
| G4-MeroMero-31B-uncensored-heretic-Q5_K_S.gguf | Q5_K_S | 较小的 Q5 |
| G4-MeroMero-31B-uncensored-heretic-Q4_K_M.gguf | Q4_K_M | 适合有限 VRAM |
| G4-MeroMero-31B-uncensored-heretic-Q4_K_S.gguf | Q4_K_S | 较小的 Q4 |
| G4-MeroMero-31B-uncensored-heretic-Q3_K_L.gguf | Q3_K_L | 低 VRAM,质量尚可 |
| G4-MeroMero-31B-uncensored-heretic-Q3_K_M.gguf | Q3_K_M | 低 VRAM,更小 |
https://huggingface.co/llmfan46/G4-MeroMero-31B-uncensored-heretic-GGUF#vision-projector视觉投影器
| 文件名 | 量化 | 描述 |
|---|---|---|
| G4-MeroMero-31B-uncensored-heretic-mmproj-BF16.gguf | BF16 | 原生精度 |
实现视觉/多模态功能需要视觉投影器文件。请配合上述任意量化版本使用。
https://huggingface.co/llmfan46/G4-MeroMero-31B-uncensored-heretic-GGUF#usage使用说明
适用于 llama.cpp、LM Studio、Ollama 以及其他兼容 GGUF 的工具。
来自 zerofata 的模型概述:
概览
Gemma 4 31B 的微调版本,专为创意任务设计。
谷歌推出的另一个难以驾驭但极其优秀的模型。
本模型在滑动多样性方面略有提升,写作风格不那么华丽/冗长。推理过程通常比原始模型稍长,但智力水平与原始模型相当。
支持思考与非思考模式。
SillyTavern 设置
建议的角色扮演格式:
- 动作:纯文本
- 对话:引号内
- 思考:星号内
推荐采样参数:
- 温度:0.8 - 1.0
- MinP:0.05
创建过程
创建过程:SFT > 合并
在约 4900 万 token 上进行 SFT 训练。
尽管使用了 4900 万 token,该数据集规模相对适中。可训练部分约为 1000-1500 万 token。所有数据集仅针对最后一轮进行训练,以忠实反映 Gemma 4 聊天模板。
该方法与 26B A4B MeroMero 非常相似。我在数据上激进地训练了 2 个 epoch,测试了多个检查点后,最终选择了 1 epoch 的检查点,该检查点具有所需风格且过拟合迹象最少。
我将此检查点合并回原始 instruct 模型,从而清理了残留的过拟合,同时保留了微调的变更。
使用 Axolotl 训练。
Mergekit 配置:
models:
- model: google/gemma-4-31B-it
- model: ApocalypseParty/G4-31B-SFT-v3-1-1ep
merge_method: slerp
parameters:
t: 0.5
base_model: google/gemma-4-31B-it
dtype: bfloat16
Axolotl 配置:
base_model: google/gemma-4-31B-it
plugins:
- axolotl.integrations.cut_cross_entropy.CutCrossEntropyPlugin
- axolotl.integrations.liger.LigerPlugin
liger_layer_norm: true
liger_rope: true
liger_rms_norm: true
liger_glu_activation: true
liger_rms_norm_gated: true
strict: false
cut_cross_entropy: true
datasets:
- path: zerofata/pretok
val_set_size: 0.02
output_dir: ./G4-31B-SFT-v3-1
sequence_len: 10756
pad_to_sequence_len: true
sample_packing: true
load_in_4bit: false
adapter: lora
lora_r: 64
lora_alpha: 64
peft_use_rslora: true
lora_dropout: 0.0
freeze_mm_modules: true
lora_target_modules: 'model.language_model.layers.[\d]+.(_checkpoint_wrapped_module.)?(mlp|self_attn).(up|down|gate|q|k|v|o)_proj'
wandb_project: G4-31B-SFT
wandb_name: G4-31B-SFT-v3-1
gradient_accumulation_steps: 1
micro_batch_size: 4
num_epochs: 2
optimizer: adamw_torch_fused
lr_scheduler: constant_with_warmup
learning_rate: 1e-5
max_grad_norm: 1.0
bf16: auto
tf32: true
logging_steps: 1
# FA2 not supported
sdp_attention: true
#flex_attention: true
#torch_compile: true
flash_attention: false
warmup_ratio: 0.1
evals_per_epoch: 4
saves_per_epoch: 2
weight_decay: 0.05
special_tokens:
fsdp_config:
fsdp_version: 2
offload_params: false
cpu_ram_efficient_loading: false
auto_wrap_policy: TRANSFORMER_BASED_WRAP
transformer_layer_cls_to_wrap: Gemma4TextDecoderLayer
state_dict_type: FULL_STATE_DICT
sharding_strategy: FULL_SHARD
reshard_after_forward: true
activation_checkpointing: true
相似文章
Gemma 4 26B-A4B GGUF 基准测试
嘿,r/LocalLLaMA 社区,我们为不同提供方的 Gemma 4 26B-A4B GGUF 进行了 KL 散度(KL Divergence)基准测试,以帮助大家挑选最佳的量化版本。* 平均 KL 散度结果使几乎所有 **Unsloth GGUF 都位于帕累托前沿** * KLD 用于衡量量化模型与原始 BF16 输出分布的匹配程度,从而反映模型保留的精度。* 这使得 Unsloth 在 21/22 种尺寸中**表现最佳。**99.9% KLD 及其他指标也呈现相似趋势。* 我们还更新了我们的 Q6_K 量化版本以提高动态性。此前,它们...
gemma-4-Ortenzya-The-Creative-Wordsmith-31B-it-uncensored-heretic 现已发布,一个写作微调模型,旨在提升Gemma 4 31B it的写作质量,提供更自然的英语和更优美的文笔,适用于创意写作、翻译和角色扮演!
一个以写作为导向的微调,基于Google的Gemma 4 31B模型,旨在实现更自然的英语和更优美的文笔,减少拒绝回答,适合创意写作、翻译和角色扮演。
首批Gemma 4 12B微调模型已就绪
首批Gemma 4 12B微调模型变体现在已在Hugging Face上线,由多位开发者提供GGUF格式。
HauhauCS/Gemma4-26B-A4B-Uncensored-HauhauCS-Balanced
HauhauCS 发布了 Gemma4-26B-A4B-Uncensored-HauhauCS-Balanced,这是 Gemma4 的无损无审查变体,经过一个多月的开发,拒绝次数为 0/465,提供 GGUF 格式。
DavidAU/Qwen3.6-27B-Heretic-Uncensored-FINETUNE-NEO-CODE-Di-IMatrix-MAX-GGUF
基于 Qwen 3.6 27B 模型的社区微调去审查版本,提供高精度 GGUF 量化。