DeepSeek R1 struggles with this
(the model is obviously groundbreaking, just wanted to share my experience)
I toyed around with it all day yesterday, including using the API. It's been great at everything I throw at it except..
language knowledge.
Hungarian is my mother tongue, but I write in English more fluently, so I type out emails and internal memos in English, then have AI translate it for me to Hungarian. I checked how R1 performs on the Multilingual Index, but it's not listed.
Based on my testing, it's close to useless for this task, falls significantly behind GPT4o. The translated text has at least one major grammar/word error in each sentence. Often sentences make 0 sense whatsoever.
4o is slightly better, but it struggles writing sentences that don't sound like they were translated literally from English to Hungarian. So tons of unnatural/foreign wording, and sentence structure.
After these two models I looked at Gemin,i 2.0 Flash Thinking Exp, and it is absolutely stellar.
I know my use case is incredibly niche, as it's a language spoken by only 14M people so the training data is incredibly limited, but just wanted to share my experience and ask y'all how it's performing in the multilingual realm for you guys.