Get ready for a blast from the past, powered by the future! Microsoft just dropped something incredible for tech enthusiasts and gamers alike: an AI-generated Quake II demo that you can actually play in real-time.
Imagine interacting with a game world not run by traditional code, but simulated entirely by an Artificial Intelligence. That’s exactly what Microsoft’s research team has achieved with their latest AI model, and they’re letting you experience it firsthand.
This isn’t just a video; it’s an interactive glimpse into how AI could reshape game development and virtual environments. Let’s dive into what this means and how it works.
Table of contents
- What is WHAMM? Microsoft’s Leap into Real-Time AI Gameplay
- How WHAMM Brings Quake II to Life with AI
- Key Improvements: What Makes WHAMM Different?
- Under the Hood: The Technology Behind WHAMM
- Playing the Model: What’s the Quake II AI Experience Like?
- Understanding the Limitations: It’s AI, Not Magic (Yet!)
- The Bigger Picture: What Does WHAMM Mean for AI and Gaming?
- Try It Yourself!
What is WHAMM? Microsoft’s Leap into Real-Time AI Gameplay
The magic behind this playable demo is a new AI model nicknamed WHAMM. It stands for World and Human Action MaskGIT Model (yes, the team admits the name is intentionally a bit silly!).

WHAMM is part of Microsoft’s larger Muse family of “world models” designed for video games. Think of world models as AI systems that learn the underlying rules and physics of an environment, allowing them to predict what happens next or even simulate the environment itself.

WHAMM builds upon previous work like the WHAM model (World and Human Action Model). But the key difference? Speed. WHAMM is designed for real-time interaction.
How WHAMM Brings Quake II to Life with AI
The core breakthrough with WHAMM is its ability to generate game visuals incredibly fast. While its predecessor might generate one image per second, WHAMM can churn out over 10 frames per second.
This speed is crucial. It means when you press a key on your keyboard or controller (like moving forward, jumping, or shooting), the AI model generates the next visual frame almost instantly. You see the consequence of your action right away, creating a playable experience inside the AI model.
Microsoft has made this interactive demo available through Copilot Labs, allowing anyone to step into this AI-generated slice of Quake II.
Key Improvements: What Makes WHAMM Different?
The team behind WHAMM has made significant strides since their earlier models. Here’s what’s changed:
Blazing Fast Speed: From 1 FPS to 10+ FPS
As mentioned, the leap in generation speed is the headline feature. Moving from roughly 1 frame per second (FPS) with WHAM to over 10 FPS with WHAMM turns a slow prediction process into a fluid, real-time experience. This is what makes the Quake II demo playable.
New Game, New Challenge: Adapting to Quake II
The team successfully applied the WHAMM approach to a completely different game: Quake II. Unlike their previous work on the game Bleeding Edge, Quake II is a much faster-paced first-person shooter with distinct mechanics. Proving the technique works on different genres is a big step.
Smarter Training: Less Data, More Focus
Perhaps one of the most impressive feats is the reduction in training data needed. While the earlier WHAM model used a staggering seven years worth of gameplay footage, WHAMM was trained on just one week of data!
How? By being much more intentional. They worked with professional game testers to capture high-quality, diverse gameplay focused specifically on a single level of Quake II. This targeted approach dramatically reduced data requirements.
Sharper Visuals: Doubling the Resolution
WHAMM outputs visuals at 640×360 resolution, a significant step up from the 300×180 used by the previous WHAM model. The team achieved this with relatively minor tweaks to how the AI processes images, resulting in a much clearer and more immersive perceived quality.
Under the Hood: The Technology Behind WHAMM
So, how does WHAMM achieve this real-time generation? It involves a shift in AI architecture.
From WHAM to WHAMM: The Architectural Shift
The previous WHAM model worked somewhat like large language models (LLMs) like ChatGPT. It generated the data for the next image one piece (or “token”) at a time. Generating hundreds of tokens sequentially takes time.
WHAMM uses a different technique inspired by MaskGIT. Instead of generating token by token, MaskGIT allows the model to generate all the tokens for an image in parallel, through multiple refinement steps. This parallel approach is much faster.
The Two-Stage Process: Backbone and Refinement
To optimize for speed, WHAMM uses a clever two-stage system:
- Backbone Transformer: A larger AI module (~500 million parameters) looks at the recent history of gameplay (previous images and actions) and makes a quick, initial prediction for all the pieces of the next image frame.
- Refinement Transformer: A smaller, faster module (~250 million parameters) takes this initial prediction and iteratively refines it using the MaskGIT process. It repeatedly masks out parts of the predicted image and re-predicts them, cleaning up the details rapidly.
This division of labor allows WHAMM to generate a reasonably detailed frame very quickly, enabling that real-time feel.
Playing the Model: What’s the Quake II AI Experience Like?
Jumping into the WHAMM Quake II demo is a unique experience. You can explore the environment, move the camera, jump, crouch, and even shoot. Explosive barrels react somewhat like they do in the original game.
Because the AI learned from gameplay data that included finding secrets, you might even stumble upon hidden areas within the level discovered entirely by the AI!
It’s important to remember you’re “playing the model,” not a perfect recreation of the original Quake II. The AI is simulating the world based on what it learned, leading to some interesting quirks and limitations.
Understanding the Limitations: It’s AI, Not Magic (Yet!)
While impressive, WHAMM is still a research project and has limitations:
- Generative Approximation, Not Replication: The AI generates an approximation of the Quake II environment. It’s not running the original game code, so expect differences and inconsistencies.
- Fuzzy Enemies and Combat Quirks: Interactions with enemies can be unreliable. They might appear blurry, and the logic for dealing or taking damage isn’t always accurate.
- The Short Memory: Context Length Issues: The model currently only considers the last 0.9 seconds of gameplay. If something (like an enemy) leaves your view for longer than that, the model might forget it existed. This can lead to funny moments, like enemies disappearing if you look away briefly.
- Counting Challenges (Health): The AI isn’t great at precise counting, so the health display might not always be accurate, affecting interactions with health packs.
- Limited Scope (Single Level): The current demo is trained only on a specific part of the first level of Quake II. Reaching the end of that section will cause the generation to freeze.
- Latency Considerations: While faster, making the demo widely available online introduces some noticeable delay (latency) between your input and the AI’s response compared to the researchers’ internal versions.
The Bigger Picture: What Does WHAMM Mean for AI and Gaming?
Microsoft’s work on WHAMM and Muse represents a fascinating direction for both AI research and the future of interactive entertainment.
These “world models” could potentially:
- Accelerate game development by having AI generate environments or simulate complex interactions.
- Create entirely new types of dynamic and adaptive game worlds.
- Power more realistic simulations for training or research.
- Offer new ways for players to interact with and even modify game environments on the fly.
While still early days, the playable Quake II demo is a tangible proof-of-concept showing the power of generative AI applied to complex, interactive domains.
Try It Yourself!
Curious to experience this AI simulation firsthand? Microsoft has made the AI-generated Quake II demo available to the public.
Head over to the Microsoft Copilot Labs page to give it a try.
Remember, you’re stepping into a research experiment, a chance to play inside an AI model. It’s a fun, sometimes quirky, but undoubtedly exciting glimpse into what the future might hold for AI and gaming. Go check it out!
| Latest From Us
- Forget Towers: Verizon and AST SpaceMobile Are Launching Cellular Service From Space

- This $1,600 Graphics Card Can Now Run $30,000 AI Models, Thanks to Huawei

- The Global AI Safety Train Leaves the Station: Is the U.S. Already Too Late?

- The AI Breakthrough That Solves Sparse Data: Meet the Interpolating Neural Network

- The AI Advantage: Why Defenders Must Adopt Claude to Secure Digital Infrastructure


