I need to convert voice to text but problem is in my file two people talking each other. I need to separate text for each of them for it. I need to first separate their voice from each other’s then convert it to text but I can’t find any python libraries to do it