OpenAI has developed a new text-to-video model called "Sora". Soon anyone will be able to script, direct, and create their own films, their own cinematographic works.

lawrence@lemmy.world · 1 年前

OpenAI has developed a new text-to-video model called "Sora". Soon anyone will be able to script, direct, and create their own films, their own cinematographic works.

Adalast@lemmy.world · 1 年前

It wouldn’t be too hard to train. There are enough audio models and computer vision models that could be trained in parallel on video clips that have recorded sound to train what sound profiles are associated with what events in the frame.

The real fun one would be to figure out how to train an AI to understand sounds originating from out of frame.