Comment What's your beef with Betteridge? (Score 1) 100
There was no need to phrase this as a question: if you have ever used YouTube subtitles, you'll know they are bad.
The real question is why. While I think AI is overhyped in general, I don't believe that this is the best Google could do if they put in some effort. It seems the model is just picking the most likely word on an individual word level, not considering any context like the video title or description (which can contain the very names it gets wrong), the type of content the channel typically publishes or even the previously transcribed text. As evidence of the latter, a name might get transcribed incorrectly in different ways during the same video.
While names are probably the #1 thing it gets wrong, it also fails on English words quite often in ways that seem very avoidable. Machine learning is all about detecting likely combinations and their transcription model should be able to guess much better if it was provided with more context than just the audio. I wouldn't be surprised that if you fed a YouTube transcript to an LLM that doesn't have access to the audio and asked it to correct unlikely words, it would end up better on average.