spanish

Tag

Cards List
#spanish

VectraYX-Nano: A 42M-Parameter Spanish Cybersecurity Language Model with Curriculum Learning and Native Tool Use

arXiv cs.CL · 2026-05-15 Cached

Presents VectraYX-Nano, a 42M-parameter decoder-only language model trained from scratch in Spanish for cybersecurity, featuring curriculum learning, native tool invocation via MCP, and a 170M-token corpus. Empirical findings reveal a loss-versus-register inversion and corpus-density artifacts for tool-use capability.

0 favorites 0 likes
← Back to home

Submit Feedback