I saw that the whisper large-v2 model is multi-lingual, so that gave me the idea to try using it to translate commentary for Korean webcasts. So, I made a clip from an old SC:BW VOD to test it out, and here's what I came up with using Whisper and then ChatGPT to clean up the results: https://streamable.com/jp0pue
I don't speak Korean so I can't evaluate the quality or accuracy of the translation. If there's anyone here who speaks Korean well, I'd love to know what you think of this.
Apart from the accuracy, I have to say that the process was quite time-consuming. As mentioned in the README, whisper is not very fast at all yet, especially the large-* models. But if someone out there is willing to put in more time, maybe it's now possible to caption more of these videos without actually speaking the language?