͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏

Forwarded this email? Subscribe here for more

All Compute Must Flow

“If I don’t have that kind of compute on Day 1, I can’t breathe."

Tina He

Mar 2

READ IN APP

The future of AI innovation faces a quiet but critical constraint. As major players secure GPU resources through exclusive contracts, promising emerging experiments struggle to access the compute they need. This growing concentration of essential infrastructure threatens to narrow the field of AI development to a select few.

In crypto, there’s a saying: “Value must flow.” Inspired by this principle, we must ask: What would it take for compute to flow freely? How can we build fluid GPU marketplaces ensuring resources immediately and fairly reach their highest-value uses?

Blockchain systems already solved a similar challenge: dynamically allocating limited processing power amid fluctuating demand. AI markets can leverage this insight, creating a true market infrastructure currently missing in the compute ecosystem.

Why now? Compute spending is projected to surpass global oil expenditure within five years. Yet unlike oil markets, AI compute lacks mature market infrastructure, trapping resources in rigid contracts, unavailable precisely when needed.

To be clear, large labs like OpenAI, Anthropic, and Meta naturally prioritize exclusive GPU access for internal workloads, but this exclusivity often leaves resources idle. Meanwhile, vast GPU power lies outside these labs in data centers and local machines worldwide. A fluid marketplace incentivizes both large labs and distributed GPU providers to monetize idle resources, unlocking supply from the long tail.

A dynamic marketplace for GPU resources would better serve smaller AI companies than the current system of rigid contracts and allocations. Building on Steve Ruiz's “sign in with AI” concept and blockchain prioritization methods, we could create a flexible system that more efficiently matches compute supply with demand.

“It’s oxygen.”

Steve Ruiz envisions a future where hundreds of specialized AI applications emerge to serve diverse needs. Instead, the GPU market's structure throttles this potential at its source.

Data from a16z's Oxygen program reveals a stark reality: startups that managed to secure GPU access show 3-4x faster development cycles than those forced to wait. Young companies face an impossible choice: spend millions upfront on compute, or waste precious engineering resources building complex tracking systems.

A recent case exemplifies the problem. As Anjney Midha explains, "Last summer, we had a portfolio company who had a signed contract for delivery of a set amount of GPUs, and at the last minute was told they would need to wait another three months. It turns out a bigger customer had come in and offered 3x more than the startup had agreed to pay." The implications for startups are severe: "The reality of market forces is that, as long as there are bigger customers who get better treatment because they're buying in bulk, there will always be a need for us to help our companies be treated that way."

"Computing power isn't just a resource for us—it's oxygen," confided one founder, who requested anonymity due to ongoing negotiations with providers. "Without day-one access, we might as well not exist."

Inspirations from Ethereum

To truly enable compute to flow, we need two complementary layers: a decentralized marketplace and a dynamic allocation system. The marketplace allows GPU providers and AI developers to freely discover and transact, unlocking trapped capacity and expanding access. The allocation layer, inspired by blockchain fee markets, dynamically directs resources to their highest-value use in real-time.

Blockchain platforms like Ethereum have tackled a similar challenge: distributing scarce compute resources fairly. Consider these parallels for AI markets:

Dynamic priority management: Ethereum transactions use priority fees to transparently reflect two key factors: urgency (how quickly a task needs execution, expressed by the user's willingness to pay a premium) and congestion (the real-time demand for scarce computing resources). Tasks compete in a queue ("mempool"), with higher bids reflecting higher urgency or greater congestion. Similarly, AI workloads could dynamically bid for GPU access, adjusting priorities in real-time based on urgency and available resources, ensuring efficient, market-driven allocation exactly when it matters most.
Fair revenue sharing: On blockchain, special mechanisms ensure the extra value from high-priority transactions is fairly shared across the system. Likewise, additional revenue from high-priority GPU jobs could be distributed fairly among those providing resources and the developers who create AI apps.
Execution guarantees: Ethereum ensures transactions either complete fully or safely revert if something goes wrong. AI tasks could use similar methods, guaranteeing that no resources or money are wasted if a GPU job is interrupted or fails.

Blockchain-inspired dynamic pricing offers transparent, granular, real-time resource allocation beyond static cloud models, in real-time matching supply with fluctuating demand. To maintain equitable access, complementary measures—such as reserved capacities for nonprofits, students, and creators—can support resource-constrained participants.

While existing cloud providers like AWS or Azure implicitly price urgency (spot vs. reserved instances), these systems are relatively static and lack real-time, transparent responsiveness. Blockchain-inspired priority markets offer a more granular, dynamic mechanism—continuously adjusting allocations in real-time, directly matching supply to fluctuating demand.

Beyond urgency, AI workloads often vary in memory requirements, hardware specialization, and reliability needs. A multi-dimensional pricing model can further refine allocations by allowing developers to pay specifically for their required resources while enabling providers to allocate capacity efficiently and accurately.

A concrete example in film-making

[SDFX] - Studio Grade New Comfy UI - (Free + Opensource) — Source: ComfyUI Subreddit

Imagine you're an independent filmmaker who just landed funding from A24 and urgently needs GPUs to render complex animations for an upcoming film festival. Today's rigid GPU scheduling forces your high-stakes tasks into a queue behind bulk workloads from large studios, causing frustrating delays.

A dynamic priority marketplace solves this instantly, with your task automatically signaling its urgency through a priority fee, ensuring timely GPU allocation exactly when needed. To ensure fairness, especially for resource-constrained creators, the platform can provide subsidies or reserved capacity for artists, students, or nonprofits, balancing equity and efficiency.

This design thoughtfully blends simplicity, fairness, and real-time responsiveness, providing timely access precisely when creators need it.

From commodity to market infrastructure

The current GPU market faces a classic innovator's dilemma. Incumbents like NVIDIA profit from artificial scarcity and rigid, long-term contracts, while cloud providers optimize for predictable workloads rather than dynamic user needs. This rigidity creates a window for disruption:

Cloud providers can differentiate themselves by adopting flexible GPU allocation, similar to how AWS reshaped hosting with dynamically priced Spot Instances. Someone like Together.ai is in a good position to integrate this piece of the stack vertically.
Emerging compute marketplaces like Compute Exchange can embed priority-based resource allocation directly, enabling immediate GPU access driven by real-time market signals.
Vertical-specific providers could thrive by precisely meeting the urgent compute demands of fields like generative media, scientific computing, or financial modeling—markets underserved by today's rigid structures.

Historically, early attempts at creating "compute futures markets" fell short, simply due to limited demand scale. Aaron Brown notes, "semiconductors in 1989 were a quarter of today's AI demand." But now, with AI’s explosive growth, a priority-based GPU marketplace is finally viable, enabling sophisticated financial instruments: spot markets for immediate compute, futures for predictable demand, and credit for resource borrowing during spikes.

First movers who implement these flexible allocation mechanisms will become essential infrastructure providers for the AI economy. Just as oil futures transformed energy markets, compute futures today offer AI developers financial tools that match and amplify their technical ambitions, an opportunity incumbents may be too slow to capture, leaving room for emerging startups.

What would it take to make this real?

A demo I made to illustraterate the idea

Equally critical is establishing a clear, verifiable standard for GPU resources, ensuring buyers know exactly what they're purchasing. Just as commodity markets validate oil quality, GPU markets can use standardized benchmarks or independent certifications to verify that purchased compute meets promised specifications. This trust in resource authenticity and quality will underpin the reliability of the entire marketplace.

The market would evolve naturally from high-value, time-sensitive applications like financial markets, where the value of priority is most evident. As more providers integrate and developers build on the protocol, we'd see the emergence of sophisticated financial instruments—futures, options, and compute-backed lending. Each phase increases efficiency and flexibility for emerging AI applications.

The challenge for such a system to work is immense. Bootstrapping a new market protocol requires coordinating countless stakeholders, each with their own incentives and constraints. But if computing truly becomes the backbone of our economy, building this infrastructure may be among the most consequential work of our generation.

Addressing common objections

There are a couple of common objections to the idea.

“Does blockchain-inspired prioritization truly suit AI’s complexity?” The goal isn't to copy blockchain methods directly, but to learn from their strengths in transparent, real-time allocation. Practical solutions may combine blockchain-inspired insights with more traditional market and institutional frameworks, balancing complexity and efficiency.

"AI workloads require stability—not volatile markets." AI tasks are longer-running and more complex than blockchain transactions, often demanding predictable access. A practical hybrid approach would let providers like AWS or Azure offer stable, reserved GPU capacity for predictable workloads alongside dynamic, priority-based queues for urgent tasks, balancing stability with real-time flexibility.

"Won’t wealthy incumbents dominate anyway?" Priority fees might favor well-funded players. Mitigations like dedicated resource pools or compute grants for smaller entities and research initiatives can preserve fairness and diversity.

"Falling costs will solve this" Hardware improvements like Blackwell (2.5x H100 performance) actually increase the need for flexible allocation. As Midha notes, "companies with H100 commitments are nervous" about being locked into older tech. Meanwhile, inference demands remain unpredictable, making fixed contracts inefficient.

Unleashing the long tail of AI creativity

Chris Paik's observation that "all value creation occurs through the reduction of friction" points to something far larger than compute allocation. The outlined priority mechanism could revolutionize how we distribute and value any scarce digital resource.

Think the emerging market for AI agents. Today, specialized AI assistants sit idle while others are overwhelmed with requests. A priority-based allocation system could create fluid markets for agent attention, automatically routing tasks to the most suitable agents based on urgency and expertise.

The applications extend to content discovery and relevance ranking. Rather than relying on static algorithms, priority signals could create dynamic marketplaces for attention.

Most intriguingly, priority-based mechanisms could enhance price discovery beyond traditional market approaches. While existing priority mechanisms (such as auctions or spot markets) assume rational participants and stable information, dynamic priority fees uniquely embrace the realities of rapidly shifting urgency, imperfect information, and volatile demand. They allow “markets to breathe," continuously adjusting allocations to match real-time human (and agent) needs and conditions.

The future diversity of AI innovation hinges on how fairly we distribute compute resources today. Ensuring compute flows isn't merely a technical challenge, it's about creating conditions for a diverse playground for experimentation.

And that’s the message: all compute must flow.

I first wrote about “digital-native commodities” in 2023 and the idea has evolved with the work I did at Station and Base. This thought exercise is just the beginning of a broader conversation about how we allocate the resources powering our shared future. While I’ve outlined one possible approach centered on priority mechanisms, there are many other innovative solutions to consider.

Here are some open questions I’m still exploring:

What alternative allocation mechanisms or market designs deserve more exploration?
If you're building an AI product today, how would dynamic priority allocation affect your process of innovation?
Blockchain technology has unique strengths for this use case, but is it truly the optimal solution, or are there simpler or more effective alternatives?
For infrastructure experts and market designers: What unintended consequences or practical challenges might arise when applying financial market mechanisms to compute resources?

These questions—and many others—will be topics I continue to investigate. If any of these resonate, or if you have insights to share, feel free to reply to this letter or drop me a note.

Special thanks to Robert Miller for the edits and review.

Further readings

The Scramble for AI Computing Power By Samuel Hammond
Concerns grow over the global AI compute divide by Harry Fowle
AI and Compute by OpenAI
Amid an A.I. Chip Shortage, the GPU Rental Market Is Booming by David A. Bader
Commodification of Compute by Ghosh et al., 2024
Compute Exchange announcement
Dynamic Pricing for Non-fungible Resources by Dimandis et al., 2024

Fakepixels is free today. But if you enjoyed this post, you can tell Fakepixels that their writing is valuable by pledging a future subscription. You won't be charged unless they enable payments.

Fakepixels - All Compute Must Flow

All Compute Must Flow

“If I don’t have that kind of compute on Day 1, I can’t breathe."

“It’s oxygen.”

Inspirations from Ethereum

A concrete example in film-making

From commodity to market infrastructure

What would it take to make this real?

Addressing common objections

Unleashing the long tail of AI creativity

Further readings

Older messages

How to build an agent

[FKPXLS] The New Frontier of Belonging

[FKPXLS] The Illusions of Free-to-Play

[FKPXLS] Brave New Decade

[FKPXLS] SPECIAL VOLUME: Embedded Education

You Might Also Like

180 / Make your everyday browsing ridiculously beautiful

Accessibility Weekly #438: When to Use Lists

High touch recruiting

🐺 Is a trade show is your right next step?

AD100 Designers on Battling Burnout

Accessibility Weekly #436: Evaluating Overlay-adjacent Accessibility Products

#495: Accessibility and Inclusive UX

AD Editors Share Their Favorite March Issue Moments

🐺 Did you know about this?