Tomasz Tunguz - The Premise of a New S-Curve in AI
Tomasz TunguzVenture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. The Premise of a New S-Curve in AI
Since July, have you noticed how much better your AI model has become? Measuring them is hard to do. All we can do is quantify the vibe : is this one better than that one? Elo is a score that measures how often one model wins against another, as judged by a human. Which model answers the prompt : “Describe the differences in texture between a Pink Lady and a Macoun apple” better? The one with the higher Elo score.1 In the last four months, the top 100 models have improved their Elo by about 60 points, with the top models now at 1339 vs 1287 in July. The biggest performance gains occurred at the center part of the distribution. Researchers have driven significantly more performance with innovations in algorithms.
The smallest models have increased performance most. October models have increased their win rates by nearly a third in four months. All of the models have improved their competitive win rates by more than 20%. In July, we posed the question : what happens when model performance asymptotes? Progress in small, medium, & large models is linear in Elo-terms. But the mega models show more data points of inflection, suggesting the recent innovations in reasoning & scale (the biggest models have grown from 200b parameters to more than 400b) have produced the beginning of a new high-growth S-curve. 1 See the Bradley-Terry model. |
Older messages
How M&A Fosters Innovation
Tuesday, October 8, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. How M&A Fosters Innovation Recently, Thomas Laffont of
Fulfilling Crypto's Original Promise
Thursday, October 3, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. Fulfilling Crypto's Original Promise Visa announced their
Where is the Budget for AI Coming From?
Tuesday, October 1, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. Where is the Budget for AI Coming From? Morgan Stanley
Would You Listen to AI Generated Podcasts?
Monday, September 30, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. Would You Listen to AI Generated Podcasts? Recently, Google
Interwoven with Initia
Wednesday, September 25, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. Interwoven with Initia Like in web2, building an app on a
You Might Also Like
Startups bolstering the new grid
Tuesday, October 22, 2024
Plus: Cazoo backer DMG Ventures announces new funds; latest deals View in browser Sponsor Card - Flagship-34 Good morning there, Earlier in the year Sifted's Freya Pratty and Mimi Billing wrote
✨ How AI Agents Can Build Your Whole App
Monday, October 21, 2024
With rapid advancements in LLMs, AI can now follow prompts to generate code and build functional custom software. This Week at YC October 21st, 2024 ✨ The Latest How AI Agents Can Build Your Whole App
◼️ Don’t invest in dark mode
Monday, October 21, 2024
Dark mode is overhyped! ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
16 Silicon Valley Startups Raised $644.8 Million - Week of October 21, 2024
Monday, October 21, 2024
🇯🇵 Trends from Japan 💸 How to Recruit Scale Engineering Teams 💰 Johor New Data Hub of SE Asia 💰 20 Min Scored $400M 🎉 Vectara's HHEM Surpassed 2M Downloads 😮 Teenager Legendary Hacker ͏ ͏ ͏ ͏ ͏ ͏ ͏
🦄 ChatGPT for Workplace Productivity
Monday, October 21, 2024
Findr uses AI to easily find info on your device.
the world’s most promising fintech companies
Monday, October 21, 2024
unpacking the Fintech 100 + Q&A with the analyst behind the list Hi there, We're getting ready to unveil the 2024 Fintech 100. Selected for their tech novelty and market potential, these
AI Prompts as PRDs : Why Prompts Will Become Important IP Assets
Monday, October 21, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. AI Prompts as PRDs : Why Prompts Will Become Important IP
Tomorrow: Ask me Anything with Darcy Lorincz
Monday, October 21, 2024
There is Still Time To RSVP!
[Inverted Passion] Getting things done by not trying
Monday, October 21, 2024
Here's a new post on InvertedPassion.com Getting things done by not trying By Paras Chopra on Oct 20, 2024 09:29 am I recently finished a very short book with an intriguing title: Why Greatness
Family offices on de-fence
Monday, October 21, 2024
Plus: How to raise a Series A; latest deals View in browser Sponsor Card - Up Round-27 Good morning there, My colleague Anne Sraders has spent the last six months reporting on the growing number of VCs