YouTube recently added the Interactive Transcript feature, which uses speech recognition technology to automatically generate the transcript.
This is a easy clue that, Google would use this data to list videos for the keywords used in the video.
Though the transcription isn’t accurate, it surly helps in SEO, if done correctly.