Tag
Researchers from CUHK-Shenzhen introduce a jailbreak method using fanfiction subgenres from Archive of Our Own as attack carriers, embedding harmful content within creative writing scenes. Their method achieves a mean attack success rate of 0.731 on eight aligned LLMs, with a multi-turn extension (Saga-A4) reaching 0.924 ASR, outperforming existing methods.
POLARIS is a training recipe using GRPO with LLM-as-judge rewards and human-reference injection to improve long-form story generation in small models. Applied to Qwen3.5-9B, the resulting POLARIS-9B model matches Qwen3.5-27B performance on creative writing benchmarks while better adhering to length instructions.
An opinion piece argues that AI-generated fiction is like fast food, lacking the depth and originality of human-written stories, emphasizing the continued need for human authors.
Gemini 3.5 Flash outperforms Gemini 3.1 Pro on a short story creative writing benchmark, improving from -2.3 to -1.8 in head-to-head comparisons.
This paper introduces a dataset and training framework that transforms human-authored novels into multi-resolution planning scaffolds, enabling long-context language models to generate book-scale fiction with more human-like prose and narrative dynamics.
A writing-focused fine-tune of Google's Gemma 4 31B model, aiming for more natural English and better prose, with reduced refusals suitable for creative writing, translations, and roleplay.
OpenAI has published content about GPT-5's creative writing capabilities, highlighting the model's performance in generating creative text. This follows the release of GPT-5, OpenAI's latest and most advanced language model.