Guys, I need your honest feedback on the translation of the eFootball Lab

Warning: Before you read this post, please make sure you watch at least one episode of Amadeusz's eFootball Lab!

I saw more than one person saying the AI voiceover of the latest episode is not quite satisfying. In my original pilot episode, I generated two versions. Version 1 used the Microsoft open source voice generator API, and version 2 and the first episode used the current voice model (sampled from the original content creator Zhuhai Amadeusz, he only speaks Chinese). I was worried that generic Microsoft voice model would turn off a lot of people because such models are used to generate spams/ads flooded on all kinds of social media. Plus Amadeusz himself prefers to use his own voice, so we were leaning towards it.

However, I do realize my voice model performs poorly. I think one reason is that my model is def not good enough, but also there is a fundamental reason that the duration of sentences are different between Chinese and English. So you can hear sometimes it sounds like rushing, but sometimes it also has awakardly no sound.

Are there any solutions? I think manually adjusting the translated subtitles would work, but it can be time consuming. Currently I can do AI translation on subtitles and then let it generate the voiceover in a short amount of time. Hoever, I need to manually translate the in-video texts and charts, and it is already 5min video = 45 min editing. I do need help from someone else if that's the case. If anyone is voluntarying, I would be very happy. Feel free to dm me if u r interested.

What about leaving the orignial Chinese voiceover, since we already have YT CC subtitles? I actually don't know whether people will like that tbh. I was worried that sololy relying on subtitles is not enough for people to follow along/understand.

Using better/paid version of the voice model? I don't think this would be an effective solution, since I basically tried most of the free models. They sound terrible when they speak very fast (again, we have to solve the fundamental problem).

So far these are potential solutions I can come up with. I would like to do a polling to collect your honest feedback. At the same time, feel free to leave a comment if you think you have a better idea. Really appreciate!

View Poll