Skip to content

December 4, 2025 / 9 min read

Before AI, It Was Just Graphics

NVIDIA looked like a gaming company until the boring infrastructure layer it had been building became the strategic layer for rendering, simulation, production, and AI.

I watched Jensen Huang sit down with Joe Rogan, and the easy version of the story is the one everybody already knows: NVIDIA made graphics cards, AI got huge, and suddenly the graphics card company became the obvious hardware name in the room.

That version is true enough to be familiar and too thin to be useful.

The more interesting story is not founder mythology. It is infrastructure. NVIDIA looked like a gaming company because gaming was the first mass market that needed what NVIDIA was really building: cheap parallel computation, real-time image generation, developer tooling, and a platform that made specialized hardware useful to normal people.

The important layer is often boring before it becomes obvious. It looks like drivers. APIs. Compilers. Developer relations. Memory bandwidth. Thermal envelopes. Chip packaging. Render pipelines. Software libraries. Annoying technical decisions that only a niche cares about until the niche becomes the future.

AI did not make NVIDIA important out of nowhere. AI revealed what NVIDIA had already been accumulating.

It was never just games

For a long time, “graphics card” sounded like a gamer purchase. It was the thing you bought so Quake, Doom, Unreal, flight simulators, and later every other visually ambitious thing could run better.

That made it easy to underestimate.

But graphics has never really been trivial. Real-time graphics is a brutal computing problem disguised as entertainment. You are trying to generate convincing reality many times per second, under hard latency constraints, on a machine somebody can fit under a desk. Every pixel is a small math problem. Every frame is a deadline. Lighting, geometry, textures, shadows, simulation, physics, animation, and camera movement all have to resolve fast enough that the illusion does not fall apart.

That is not “just games.” That is consumer supercomputing with a dragon on the box.

In the Rogan conversation, Huang keeps returning to this idea: computer graphics was an early application for a different style of computation. NVIDIA’s CUDA guide describes the GPU as starting out as hardware for parallel operations in real-time 3D rendering, then becoming more programmable over time. CPUs are built around fast serial work. GPUs are built around doing huge numbers of related operations in parallel.

That distinction sounds technical because it is. It is also a business distinction hiding inside the hardware.

The first killer app was visual. Make the screen better. Make the game faster. Make the simulation feel real. Make the world render.

The long bet was that this kind of computation would matter far beyond the screen.

Infrastructure wins quietly

Infrastructure usually wins quietly before it wins loudly.

A road is boring until the city grows around it. A port is boring until trade depends on it. A file format is boring until every department needs to exchange work through it. A render farm is boring until the movie cannot ship without it.

NVIDIA’s bet belongs in that category. The company was founded in 1993 around 3D graphics for gaming and multimedia. The visible product was the card. The deeper product was an ecosystem: hardware, software, drivers, developer tools, game relationships, and eventually a programming model that let people treat the GPU as something more general than a graphics device.

That is the move. The hardware matters, obviously. But the hardware is not the whole moat. A chip without a software stack is a component. A chip with libraries, tooling, developer habits, training material, examples, installed base, and years of weird edge cases already solved becomes infrastructure.

This is why CUDA matters. NVIDIA introduced CUDA in the mid-2000s, giving developers a way to use GPU throughput outside graphics APIs. The same basic hardware originally justified by games and graphics could now be used for scientific simulations, image classification, databases, analytics, machine learning, and eventually generative AI.

That did not look obvious at first. Near the end of the Rogan conversation, Huang says NVIDIA added CUDA before customers were asking for it, increasing cost before the market knew how to value it. That is the uncomfortable part of a long bet: for a while, the spreadsheet makes you look wrong.

This is where I think the NVIDIA story is more useful if we strip out the hero lighting. The point is not “one genius saw the future.” The point is that a company spent years building a technical layer before the mainstream use case arrived.

That is rarer, and more instructive.

Timing, not destiny

The clean narrative says NVIDIA built the GPU, deep learning arrived, and the plan worked.

The real narrative is messier.

Neural networks were not new. AI had already spent decades cycling between promise and disappointment. Then computer vision broke open. In the 2012 AlexNet paper, Alex Krizhevsky, Ilya Sutskever, and Geoffrey Hinton describe training a deep convolutional network on two NVIDIA GTX 580 GPUs, a result that became one of the canonical moments in modern deep learning.

That detail matters because it is almost too perfect: the AI breakthrough that helped set off the current era ran on graphics cards.

Not a sacred machine in a government lab. Not a bespoke AI appliance from the future. Graphics cards.

The tools built to make games faster turned out to be useful for teaching machines to see. The work done for rendering, pixels, shaders, and parallel math became relevant to neural networks because neural networks also thrive on large amounts of parallelizable computation.

That is timing. But it is not pure luck.

Luck is AlexNet showing up when it did. Luck is researchers finding the hardware. Luck is the world finally having enough data, compute, and technique in the same place.

The non-luck part is that NVIDIA had already spent years making the hardware programmable, available, and supported. When the moment arrived, there was something for the moment to land on.

That is what infrastructure does. It waits.

Why production people should care

Here is the Bill angle: if you come from VFX, production, post, photography, games, virtual production, or creative technology, this story should feel familiar.

You already know that the unglamorous layer is usually where the leverage lives.

The audience sees the shot. The producer sees the schedule. The supervisor sees the pipeline. The artist sees the comp. The render wrangler sees the farm. The editor sees the storage problem. The colorist sees the monitor chain. The person responsible for delivery sees the codec, the file naming, the conforms, the missing plates, the bad metadata, the overnight render that failed at 4:12 a.m.

Infrastructure is where creative ambition either becomes possible or turns into a meeting.

That is why NVIDIA’s story is not just a Wall Street story or an AI story. It is a production story.

Modern production is already a stack of specialized compute. VFX depends on rendering, simulation, compositing, color pipelines, asset management, and increasingly real-time engines. Virtual production workflows moved game-engine technology into previs, pitchvis, techvis, postvis, live compositing, in-camera VFX, virtual scouting, and earlier design review. AI is now sliding into the same stack: search the archive, generate references, build temp boards, rough out shots, denoise plates, assist roto, synthesize voices, summarize footage, organize notes, test looks, and accelerate the annoying connective tissue around the work.

The common thread is not “AI replaces artists.” That is the shallow argument.

The common thread is that the pipeline keeps moving toward more programmable reality.

First the image became computable. Then the scene became computable. Then the camera, lighting, physics, materials, motion, and edit decisions became increasingly computable. Now language, reference, planning, coding, search, and visual ideation are becoming part of the same computational workspace.

That does not remove taste. It raises the price of taste.

When the tool makes rough output cheap, the valuable person is not the one who can generate the most options. The valuable person is the one who understands what should survive contact with the production.

VFX people should understand NVIDIA because they have been living inside this transition longer than most people. They know that a faster render does not automatically make a better shot. A real-time preview does not eliminate judgment. A beautiful synthetic frame can still be useless if it cannot be approved, tracked, licensed, matched, rendered, versioned, or delivered.

The tool is never the whole system.

The platform was the product

One of the least interesting ways to talk about NVIDIA is to obsess over any single chip as if the story is just benchmark theater.

The better question is what becomes possible once enough people build around the same toolchain. That is what happened with games. Developers optimized for the hardware. Engines absorbed the assumptions. Players bought the cards. The cards justified better games. Better games justified better cards. A market became a loop. Huang describes this as turning a technology business into a platform business. That is the important sentence.

The same pattern shows up again in AI. Researchers write against the libraries. Frameworks support the hardware. Cloud providers buy the systems. Startups assume the stack. Engineers learn the tools. Models are designed around the available compute.

At that point, infrastructure is not passive anymore. It starts shaping the field.

This is why the boring layer becomes strategic. The API influences what gets built. The engine influences the shot. The render budget influences the creative. The available compute influences the model. The energy constraint influences the geography.

People like to imagine technology arriving as a clean new thing. More often, it arrives as a dependency.

The factory layer

The deeper NVIDIA goes into AI, the less the story is about a chip in isolation.

It becomes about fabs, packaging, memory, interconnects, data centers, cooling, power, supply chains, and export controls. It also becomes about customers trying to turn capital into capability before the next model generation changes the target.

That is a very different story from “graphics card company gets hot.”

AI at scale is industrial infrastructure. It needs land, power, water, network capacity, specialized labor, long-range planning, and giant purchasing commitments. The International Energy Agency estimates data centers used about 415 TWh of electricity in 2024 and projects that demand could roughly double by 2030, with AI accelerating deployment of high-performance servers. The broader numbers back that up. The limiting factor is not just whether somebody can imagine a bigger model. It is whether somebody can power, cool, connect, finance, and operate the machine that trains or serves it.

Again, production people should recognize this.

The creative question is always attached to the physical one. Can we make the shot? Can we render it by Friday? Can we store the plates? Can we move the data? Can the stage handle it? Can the crew turn it around? Can the budget absorb the change?

AI did not escape those questions. It made them larger.

Less myth, more pattern

Jensen Huang is impressive. You can say that without turning the piece into incense.

The founder version of the NVIDIA story is tempting because it gives the narrative a face. It is easier to talk about a leather jacket than a software ecosystem, or charisma instead of memory bandwidth.

But the more useful pattern is bigger than one person.

Some technologies look narrow because the first market is narrow. Some tools look like toys because the first users are playing. Not every niche tool becomes world historical. Most stay niche. Some disappear. Some deserve to disappear. But occasionally, the niche is where the future is rehearsing.

Graphics looked like games. Games required real-time worlds. Real-time worlds required parallel computation. Those habits turned out to be useful for simulation, rendering, scientific computing, machine learning, and AI.

The line from Quake to ChatGPT is not straight. It is not destiny. But it is real enough to make you reconsider what “just a tool” means.

What to watch next

I am watching the tools that look too specialized for mainstream people to care about: real-time engines used earlier in design, local AI tools for private footage search, simulation systems for synthetic training data, render pipelines that collapse previs and review, and agents that handle the boring glue between files, calendars, notes, transcripts, shot lists, asset databases, and delivery specs.

Some of it will be hype. Some of it will be embarrassing. Some of it will waste money in exactly the ways new tools always waste money.

But some of it will become infrastructure.

The future rarely arrives already looking like the future. It usually arrives as something too technical, too expensive, too niche, too weird, or too easy to dismiss. A toy. A plugin. A graphics card. A game engine. A render setting. A command line tool. A thing only the obsessive people care about.

Then timing changes.

The boring layer becomes the strategic layer.

And everybody suddenly pretends it was obvious.