What Happens When AI Performance Asymptotes?
Tomasz TunguzVenture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. What Happens When AI Performance Asymptotes?
In the past, the bigger the AI model, the better the performance. Across OpenAI’s models for example, parameters have grown by 1000x+ & performance has nearly tripled.
But model performance will soon asymptote - at least on this metric. This is a chart of many recent AI models’ performance according to a broadly accepted benchmark called MMLU. 1 MMLU measures the performance of an AI model compared to a high school student. I’ve categorized the models this way :
Over time, the performance is converging rapidly both across model sizes & across the model vendors. What happens when Facebook’s open-source model & Google’s closed-source model that powers Google.com & OpenAI’s models that power ChatGPT all work equally well? Computer scientists have been challenged distinguishing the relative performance of these models with many different tests. Users will be hard-pressed to do better. At that point, the value in the model layer should collapse. If a freely available open-source model is just as good as a paid one, why not use the free one? And if a smaller, less expensive to operate open-source model is nearly as good, why not use that one? The rapid growth of AI has fueled a surge of interest in the models themselves. But pretty quickly, the infrastructure layer should commoditize, just as it did in the cloud where three vendors command 65% market share : Amazon Web Services, Azure, & Google Cloud Platform. The applications & the developer tooling around the massive AI commodity brokers is the next phase of development - where product differentiation & distribution differentiate rather than brilliant, raw technical advances.2 1 MMLU measures 57 different tasks including math, history, computer science & other topics. It’s one measure of many & it’s not perfect - like any benchmark. There are others including the Elo system. Here’s an overview of the differences.. Each benchmark grades the model on a different spectrum : bias, mathematical reasoning are two other examples. |
Older messages
What If LLMs Change the Business Model of the Internet?
Wednesday, February 28, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. What If LLMs Change the Business Model of the Internet? Last
The Secrets to Building Vibrant Communities in Web3 Open-Source
Tuesday, February 27, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. The Secrets to Building Vibrant Communities in Web3 Open-Source
The First $100m ARR AI Security Company
Thursday, February 22, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. The First $100m ARR AI Security Company Palo Alto Networks,
Nobody Knows : Steel & Blockchains
Tuesday, February 20, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. Nobody Knows : Steel & Blockchains Asking “What problems
Building a $20b Behemoth : Office Hours with Steven Goldfeder of Offchain Labs
Monday, February 19, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. Building a $20b Behemoth : Office Hours with Steven Goldfeder
You Might Also Like
What’s 🔥 in Enterprise IT/VC #425
Saturday, December 21, 2024
Will business applications collapse in the agent era? Where does logic go + what are the "second order" effects when dozens or hundreds of agents are running amok at enterprises? ͏ ͏ ͏ ͏ ͏ ͏
🚀 The 2024 Space Stock Rally
Friday, December 20, 2024
Plus Synspective IPO's in Tokyo, $HON exploring strategic alternatives for Aerospace business, and more! The latest space investing news and updates. View this email in your browser The Space Scoop
🚨 Announcing: The inaugural “What’s in your stack?” survey
Friday, December 20, 2024
Tracking the most commonly used (and beloved) tools in tech ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
2025 Predictions
Friday, December 20, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. 2025 Predictions Every year I make a list of predictions &
🗞 What's New: What OpenAI's new o3 model means for indie hackers
Friday, December 20, 2024
Also: AI-powered reading is coming ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
🚨 Our final email. Don’t miss the insights that could change your 2025
Friday, December 20, 2024
24 hours to catch the best moments of the ecommerce summit 2025! Hey Friend , This is it—our final email about this. The full replay of the Start Your Ecommerce Business Summit 2025 will be gone in 24
How 5 predictions for social media in 2024 played out
Friday, December 20, 2024
Plus, a guide to using Apple's Image Playground in Buffer ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Breaking my Own Rules— The Bootstrapped Founder 364
Friday, December 20, 2024
Sometimes, you have to pivot. And that's harder than it seems: old assumptions are deeply ingrained, new frontiers look scary. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
10words: Top picks from this week
Friday, December 20, 2024
Today's projects: Killer Portfolio • Fugoya • No-Code Exits • OpenQR • AutoMemes.ai • Cuppa • Orange Pill Coffee • Startups Gallery • Mida.so • Based Labs AI • Samurai AI • Sideproject MVP 10words
🎁 The One Gift You Forgot (Hint: It's for You) ✨
Friday, December 20, 2024
MicroConf Hey Rob! You've probably checked off most names on your holiday gift list by now. 📝 But there's one person you might have overlooked – yourself. 🤔 While everyone else is unwrapping