zeroth-order-optimization

Tag

Cards List
#zeroth-order-optimization

GRZO: Group-Relative Zeroth-Order Optimization for Large Language Model Fine-Tuning

arXiv cs.LG · 2026-06-03 Cached

GRZO is a novel zeroth-order optimization method for fine-tuning large language models that reduces variance by using group-relative normalization, achieving better accuracy and memory efficiency compared to MeZO.

0 favorites 0 likes
#zeroth-order-optimization

Aligned but Fragile: Enhancing LLM Safety Robustness via Zeroth-Order Optimization

arXiv cs.AI · 2026-05-29 Cached

This paper proposes a hybrid framework combining first-order safety alignment with zeroth-order refinement to enhance the robustness of LLM safety alignment against post-alignment perturbations. Theoretical and empirical results show that only a few refinement steps can improve robustness while preserving safety.

0 favorites 0 likes
← Back to home

Submit Feedback