Open Source Models : What Can We Determine from Download Patterns?
Tomasz TunguzVenture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. Open Source Models : What Can We Determine from Download Patterns?
Open source models have become a critical part of the AI landscape. I was curious about the trends in the open source ecosystem, so I analyzed HuggingFace data on the top 300 open source models, both by overall usage & also the top of the trending list. Open source models are governed by open source licenses. Similar to regular open source software, Apache & MIT dominate the licenses by model count. 76% of the top models choose one of these licenses. Apache is nearly twice as popular as MIT. But the concentration is greater when viewing the share by downloads. Models with Apache or MIT licenses represent 92% of downloaded models last month. Stability, Facebook, & Microsoft top the creator list of open source models by count. So does TheBloke, an engineer who quantizes (or compresses) open source models. But the download data shows very different patterns. Meta’s models recorded 30% of downloads, driven by its word2vec model for speech recognition. Then OpenAI & Google not far behind. The most popular models by downloads are models for training other models, called Fill-Mask models. Then speech recognition. Third is text classification (LLMs are very good at this.) Text generation is fifth. How about popularity? HuggingFace likes of a model are completely uncorrelated to downloads with an R^2 of 0.06. Overall, we can conclude more lax licenses dominate the top models. Meta, Google, Microsoft, Stability, & OpenAI are important players within the open source ecosystem. Speech is the most popular end-user application of open source models by downloads in the last month, superseded by testing - which makes sense given how many companies are building or testing LLMs. Given all the innovation in the space, in a quarter or two, this data might be very different. Who do you think will top the charts at the end of 2024? |
Older messages
The 10x Salesperson
Friday, January 12, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. The 10x Salesperson The 10x programmer - engineers whose
Gordian Knots in Software Engineering
Tuesday, January 9, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. Gordian Knots in Software Engineering Measuring engineering
Make Hay When the Sun Shines : Liquidity in Startup Exits
Monday, January 8, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. Make Hay When the Sun Shines : Liquidity in Startup Exits
2024 Predictions
Tuesday, January 2, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. 2024 Predictions Every year I make a list of predictions &
Why Startup M&A in 2024 Will Rebound
Thursday, December 21, 2023
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. Why Startup M&A in 2024 Will Rebound Earlier this week,
You Might Also Like
#211 | The Next Standard Oil, AI SDRs, Four Futures, & more
Sunday, December 22, 2024
Dec 22nd | The latest from IVP, Madrona, Khosla Ventures, Sapphire, USV, and others ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
This might be the simplest $12K/month app ever
Sunday, December 22, 2024
Starter Story Sunday Breakfast ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Behind the founder: Marc Benioff
Sunday, December 22, 2024
Salesforce CEO Marc Benioff on the beginners mind, AI, marketing, leadership, and life lessons from Steve Jobs ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
🎄 𝟖 𝐃𝐚𝐲𝐬 𝐎𝐧𝐥𝐲: 𝟖𝟓% 𝐎𝐅𝐅 + 𝟏 𝐅𝐑𝐄𝐄 𝐂𝐨𝐮𝐫𝐬𝐞!
Sunday, December 22, 2024
This 8-day holiday deal is the perfect gift for your 2025 self! fdrlogo Hey Friend , The holidays are here, and it's time to kickstart your journey toward an incredible 2025! Foundr's 8-Day
What’s 🔥 in Enterprise IT/VC #425
Saturday, December 21, 2024
Will business applications collapse in the agent era? Where does logic go + what are the "second order" effects when dozens or hundreds of agents are running amok at enterprises? ͏ ͏ ͏ ͏ ͏ ͏
🚀 The 2024 Space Stock Rally
Friday, December 20, 2024
Plus Synspective IPO's in Tokyo, $HON exploring strategic alternatives for Aerospace business, and more! The latest space investing news and updates. View this email in your browser The Space Scoop
🚨 Announcing: The inaugural “What’s in your stack?” survey
Friday, December 20, 2024
Tracking the most commonly used (and beloved) tools in tech ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
2025 Predictions
Friday, December 20, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. 2025 Predictions Every year I make a list of predictions &
🗞 What's New: What OpenAI's new o3 model means for indie hackers
Friday, December 20, 2024
Also: AI-powered reading is coming ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
🚨 Our final email. Don’t miss the insights that could change your 2025
Friday, December 20, 2024
24 hours to catch the best moments of the ecommerce summit 2025! Hey Friend , This is it—our final email about this. The full replay of the Start Your Ecommerce Business Summit 2025 will be gone in 24