Sora’s Game Content

OpenAI has not revealed the source of data it has used to train Sora, its video-generating AI released on December 9, 2024. It is speculated that at least some of the data might have been sourced from Twitch streams and walkthroughs of games.

Sora responds to a text prompt or image and generates up to 20-second-long videos in a range of aspect ratios and resolutions.

OpenAI first revealed Sora in February 2024 and referred to the fact the model has been trained on Minecraft Videos.

Sora can generate a fighter in the style of Ninja Turtle game of the 90s. It knows what a Twitch stream looks like and it means that it has seen a few. It features the likeness of Popular Twitch Streamer which goes by the name of Auronplay. It has generated a video character similar in appearance to Imane Anys, better known as Pokimane. OpenAI has used filtering to prevent Sora from generating clips of trade-marked characters. However, it seems game content has gone into training data.

In past, the previous CTO Mira Murati would not deny outright that Sora was trained on YouTube, Instagram and Facebook content. In the tech specs, OpenAI acknowledged it used ‘publicly available’ data along with licensed data from stock media libraries like Shutterstock to develop Sora.

Generative AI models like Sora are probabilistic, learning patterns in the data to make predictions. In the process, they may produce near-copies of their training examples. It offends the creators. There are legal issues. AI companies claim their outputs are transformative and not plagiaristic.

Play videos have three layers — unique video the player or videographer, contents of the game owned by the developer and there could be a third layer of user-generated content appearing in software. There are many protectable elements in the game. AI companies could prevail in the legal disputes since the output is transformative. However, it is tricky when the model regurgitates a copyrighted work.

AI companies have indemnity clauses to cover these situations. There are issues of fair use. They affect the video game industry.

print

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *