How can I compare a manmade audio transcript with a speech-to-text transcript, automatically adding missing words into the original transcript?
My question should be pretty clear. I proofread legal transcripts of audio files which were taken down live. My job is then to proofread this after the fact while listening to the audio recording, and to fix any errors, add in missing words, or remove incorrect words.