Could anyone recommend tools or libraries that support this functionality? The only solution that I am currently thinking of is to use ControlNet with stable diffusion to generate images and combine them into a video. However, I’m open to exploring other tools or libraries that might better suit this requirement.
Thank you in advance for your help!
I have looked into basic image-to-video conversion tools, but haven’t found many that explicitly support adding prompts or annotations in a flexible manner. Any suggestions, pointers to libraries, or even code examples would be greatly appreciated.
Raymond Li is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.