multilingual

#multilingual

Cross-Lingual Exploration for Parametric Knowledge

arXiv cs.CL ↗ · 5h ago Cached

This paper explores cross-lingual prompting strategies to improve access to parametric knowledge in large language models, demonstrating significant gains in knowledge transfer and factual recall across 17 languages on multilingual benchmarks.

0 favorites 0 likes

#multilingual

MMed-Bench-IR: A Heterogeneous Benchmark for Multilingual Medical Information Retrieval

arXiv cs.CL ↗ · 5h ago Cached

MMed-Bench-IR is a heterogeneous benchmark for multilingual medical information retrieval across six languages, evaluating cross-lingual alignment, concept discrimination, and evidence retrieval. It reveals severe performance drops for non-English queries, highlighting gaps in existing English-only evaluations.

0 favorites 0 likes

#multilingual

@noctus91: Mistral OCR 4 reading a handwritten Henri Poincaré letter from 1905. Historical manuscripts usually break OCR models. T…

X AI KOLs Following ↗ · 15h ago Cached

Mistral AI releases Mistral OCR 4, which can read historical handwritten manuscripts and provides bounding boxes, block classification, and inline confidence scores in 170 languages.

0 favorites 0 likes

#multilingual

Mistral OCR 4

Hacker News Top ↗ · 19h ago Cached

Mistral AI releases Mistral OCR 4, a compact document intelligence model that provides bounding boxes, block classification, and inline confidence scores for structured text extraction. It supports 170 languages, runs in a single container for self-hosted deployment, and integrates with the Mistral Search Toolkit for enterprise search and RAG pipelines.

0 favorites 0 likes

#multilingual

ReMMD: Realistic Multilingual Multi-Image Agentic Verification for Multimodal Misinformation Detection

Hugging Face Daily Papers ↗ · yesterday Cached

ReMMD introduces a realistic multilingual multi-image agentic verification framework for multimodal misinformation detection, including a benchmark (ReMMDBench) with 500 samples and 2,756 images, and an agent (ReMMD-Agent) that achieves superior veracity performance with reduced costs.

0 favorites 0 likes

#multilingual

PP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters

Hugging Face Blog ↗ · yesterday Cached

PP-OCRv6 is the latest generation of PaddleOCR's universal OCR model family, offering three tiers from 1.5M to 34.5M parameters, supporting 50 languages, and achieving significant accuracy improvements over previous versions.

0 favorites 0 likes

#multilingual

Apertus – Open Foundation Model for Sovereign AI

Hacker News Top ↗ · 2d ago Cached

Apertus is a fully open foundation model for sovereign AI, developed by the Swiss AI Initiative. It is open weights, open data, open science, compliant with EU AI Act, and competitive with top open models at 8B and 70B parameters, supporting over 1000 languages.

0 favorites 0 likes

#multilingual

@OpenAI: To improve our models, we collaborate with a global network of hundreds of physicians across 60 countries, 49 languages…

X AI KOLs ↗ · 5d ago Cached

OpenAI announces GPT-5.5 Instant, now on par with frontier thinking models for health-related questions, available to all free users, with improvements in recognizing urgent care and explaining uncertainty.

0 favorites 0 likes

#multilingual

@liquidai: Introducing LFM2.5-Embedding-350M and LFM2.5-ColBERT-350M: two multilingual retrieval models built for ultra-fast and a…

X AI KOLs Following ↗ · 5d ago Cached

Liquid AI introduces LFM2.5-Embedding-350M and LFM2.5-ColBERT-350M, two multilingual retrieval models optimized for fast and accurate search across 11 languages, with latency as low as 1.5ms.

0 favorites 0 likes

#multilingual

@lmsysorg: SGLang-Omni now serves MOSS-TTS-Local Transformer v1.5 from @Open_MOSS on day 0! This is an open 48 kHz stereo TTS mode…

X AI KOLs Timeline ↗ · 6d ago Cached

MOSS-TTS-Local Transformer v1.5 is an open-source 48 kHz stereo TTS model with zero-shot voice cloning, native streaming, and support for 31 languages, built on a Qwen3-4B backbone and served via SGLang-Omni.

0 favorites 0 likes

#multilingual

@MosiAI_Official: MOSS-TTS Local Transformer v1.5 is here. Clone any voice. Speak any language. Hear every detail. 30+ languages, 48 kHz …

X AI KOLs Following ↗ · 6d ago Cached

MosiAI has released MOSS-TTS Local Transformer v1.5, a text-to-speech model that supports voice cloning, over 30 languages, and high-quality 48 kHz output.

0 favorites 0 likes

#multilingual

LLM Parameters for Math Across Languages: Shared or Separate?

arXiv cs.CL ↗ · 6d ago Cached

This paper presents a cross-lingual mechanistic analysis of mathematical reasoning in LLMs, finding partial overlap of math-associated parameters across languages, concentrated in intermediate layers. English has the largest set of math-relevant parameters, while lower-resource languages have smaller sets.

0 favorites 0 likes

#multilingual

@FakeMaidenMaker: Explosive! This open-source project converts text to human-like voice for free, can clone anyone's voice, and adjust timbre with text! GitHub has garnered 30K stars, from Mianbao Intelligent OpenBMB, VoxCPM previously topped both GitHub and HuggingFace charts. Do...

X AI KOLs Timeline ↗ · 2026-06-17 Cached

VoxCPM2 is an open-source speech synthesis model from OpenBMB, using a tokenizer-free diffusion autoregressive architecture, supporting 30 languages, voice design, and controllable voice cloning. It can clone a voice with just one sentence, or create a brand new voice using text, outputting 48kHz high-quality audio, and is commercially usable.

0 favorites 0 likes

#multilingual

When English Isn't the Best Teacher: Source Language Effects in Cross-Lingual In-Context Learning

arXiv cs.CL ↗ · 2026-06-17 Cached

This paper empirically studies cross-lingual transfer in in-context learning across seven tasks, six models, and typologically diverse languages, showing that fine-tuning based expectations do not consistently apply and offering new heuristics for source language selection.

0 favorites 0 likes

#multilingual

Are you speaking my languages? On spoken language adherence in multimodal LLMs

arXiv cs.CL ↗ · 2026-06-17 Cached

This paper addresses the problem of spoken language adherence in multimodal LLMs for ASR, proposing a soft prompting approach and novel metric to quantify language violations. It evaluates three mitigation strategies—zero-shot prompting, supervised fine-tuning, and chain-of-thought reasoning—across multiple languages to improve transcription fidelity.

0 favorites 0 likes

#multilingual