Tag
A study evaluating six commercial AI chatbots on factual questions derived from BBC News across six languages, finding high multiple-choice accuracy but significant drops in free-response, with retrieval errors driving over 70% of failures and revealing regional biases.