Tag

#fine-tuning

2 posts tagged fine-tuning.

bypass

G4-MeroMero-31B: Abliteration Drops Refusal Rate 99% to 15%

A new uncensored fine-tune of Gemma 4 31B achieves a 15/100 refusal rate via Arbitrary-Rank Ablation on attention output projections — KL divergence 0.
May 15, 2026
alignment

LLM Alignment: What It Does, Where It Breaks, How to Deploy

LLM alignment trains models to internalize safety constraints — but every technique has documented bypass paths.
May 10, 2026