AI Develops Doom in Its Own Game Engine — Google's GameNGen Project Uses Stable Diffusion to Simulate Gameplay

Scientists from Google Research have published a paper on GameNGen, an AI-based game engine that generates original Doom gameplay on a neural network. Scientists Dani Valevski, Yaniv Leviathan, Moab Arar, and Shlomi Fruchter designed GameNGen to use Stable Diffusion to process previous frames and current input from the player to generate new frames into the world with incredible visual fidelity and consistency.

Generating a complete game engine with consistent logic using AI is a unique achievement. GameNGen's Doom plays like a real video game, with turning, strafing, firing weapons, and precise damage from enemies and environmental hazards. As you explore, real levels are built around you in real time. Even your pistol's ammo count is recorded with near-precise accuracy. Research shows that the game runs at 20 FPS and is hard to distinguish from real Doom gameplay in short clips.

To get all the training data GameNGento needed to accurately model Doom levels, the Google team trained an agent AI to play Doom at every difficulty level, simulating a range of player skill levels. Actions like collecting power-ups and completing levels were rewarded, while player damage and death were penalized. They created an agent capable of playing Doom, providing the GameNGen model with hundreds of hours of visual training data to reference and recreate.

A key innovation in this work is how the scientists maintained cohesion between frames over long periods of time while using Stable Diffusion. Stable Diffusion is a ubiquitous generative AI model that generates images from images or text prompts, and has been used in animation projects since its release in 2022.

The two major weaknesses of Stable Diffusion in animation are a lack of frame-to-frame consistency and an eventual degradation of visual fidelity over time. As seen in Corridor's Anime Rock Paper Scissors short film, Stable Diffusion can create convincing still images, but it introduces a flickering effect when the model outputs successive frames (note how a shadow appears to jump across the actor's face with each frame).

Flickering can be fixed by feeding the output into Stable Diffusion and training it with the images it creates, which will ensure that the frames match up with each other, however after a few hundred frames the image generation becomes less accurate and resembles copying a copy too many times.

Google Research solved this problem by training new frames on a longer series of user input and previous frames rather than a single prompt image, and corrupting these context frames with Gaussian noise. Now, a separate but connected neural network corrects the context frames, resulting in images that are constantly self-correcting and a high level of visual stability that is maintained over time.

A pair of time-lapse photos of GameNGen generating Doom without (top) and with (bottom) Gaussian noise and correcting its own reference frame. (Images by Dani Valevski et al.)

The GameNGen examples we've seen so far are certainly not perfect. Specks and blurry objects appear randomly on the screen. Dead enemies turn into blurry blobs after death. Doomguy on the HUD is constantly moving his eyebrows up and down, like The Rock on Monday Night Raw. And, of course, the generated levels are inconsistent at best. The YouTube video embedded above ends with Doomguy suddenly stopping taking damage at 4% and then spinning 360 degrees in the poison pit, completely changing his positioning.

While the result isn't a winning video game, GameNGen does produce a neat imitation of the Doom we all love. Somewhere between a tech demo and a thought experiment for the future of AI, Google's GameNGen will be an important part of future AI game development if the field continues. Combined with Caltech's research using Minecraft to teach AI models to generate maps consistently, AI-based video game engines may be coming to a computer near you sooner than we thought.

Source link

What's Hot

Putin’s envoy suddenly visits Washington. To “relieve tension”

Ambassador in Vatican? "Sikorski can offer it to tulus"

Russia. Donald Trump presents his duties. Russian media emphasizes one sentence

Enhance scientific, technical and health cooperation between Komstique and China

Jordan recognizes the importance of investing in young minds and supporting an innovation environment – a coalition of press agencies for organisations of Islamic cooperation

Comestic General Coordinators are taking part in the World Mobile Communications Conference 2025

Everything you need to know about Mercosur and its translation into several different languages

6 Best Free Online Translation Tools

How do I translate my mobile app?

EF Polymer Named to Forbes' List of 100 Asia Companies to Watch for Agricultural Innovation

Champions League draw: Improved format packed with high-profile rematches between Europe's biggest clubs

Costacurta expects Milan to have a 'good journey' in Europe this season

Two of Europe's most successful teams meet in the Champions League

Ambassador in Vatican? "Sikorski can offer it to tulus"

Review: 7 Future Fashion Trends Shaping the Future of Fashion

Meta’s AlbedoGAN Advances Realistic 3D Face Generation

Subscribe to Updates

What's Hot

AI Develops Doom in Its Own Game Engine — Google's GameNGen Project Uses Stable Diffusion to Simulate Gameplay

Related Posts