Open Source Scored the First Major M&A of the Generative AI Era
Was this email forwarded to you? Sign up here Next Week in The Sequence:
📝 Editorial: Open Source Scored the First Major M&A of the Generative AI EraM&A activity is always interesting to evaluate the health of a tech market. While fundraising activity often forecasts the value of a company in the relatively long term, M&A activity provides a pragmatic view of what exit strategies might look like for a specific segment of companies. Having too much or too little M&A in a market is always bad; you want just the right level of deals to rationalize valuations in a sector. Well, last week, we witnessed the first high-profile M&A transaction in the generative AI space, and it went to the open-source column. Databricks agreed to acquire MosaicML for an astonishing $1.3 billion valuation. MosaicML is a two-year-old company behind the open-source MPT-30B and MPT-7B models, and it has built a state-of-the-art platform for training and fine-tuning foundation models. This deal is incredibly significant for several reasons. Firstly, it demonstrates the real potential of open-source foundation models as a viable alternative to closed, API-based models. I mean, to pay $1B+ for something, you must be truly convinced that these open-source models will match the quality of GPT-4, Claude, and PaLM. If you haven't tried MPT-30B, I think you will be pleasantly surprised by its tremendous quality. Secondly, Databricks' enterprise distribution can act as a strong catalyst for the adoption of MPT models and eliminate barriers for open-source generative AI. Lastly, paying $1.3 billion for a two-year-old company in a highly competitive space might seem irrational, but it shows that Databricks believes the MosaicML platform can unlock $10 billion to $20 billion in value. The MosaicML acquisition follows other significant transactions, such as Snowflake acquiring Streamlit for $800 million last year and Neeva for $150 million this year. Beyond the economics, I believe the Databricks-MosaicML deal is an incredible stamp of approval for open-source ML. Now we should see what Databricks' competitors (like Snowflake 😉 ) do. 🔎 ML ResearchCoDiMicrosoft Research published a paper detailing CoDi, a generative AI model capable of generating content across different modalities such as language, image, audio or video. Together with the paper, Microsoft announced Project i-Code to foment multimodal generative AI —> Read more. ZeRO++Microsoft Research published a paper detailing ZeRO++, a high performance communication pipeline optimized for LLM training. As it names indicates, ZeRO++ is built on top of ZeRO but reduces the communication volume by 4x —> Read more. A Unified Pretraining Strategy for Computer Vision ModelsGoogle Research published a paper unveiling a pretraining strategy that combines image captioning and image classification. The strategy delivers amazing performance in zero shot classification tasks —> Read more. XGenSalesforce Research open sourced XGen, a 7 billion parameter LLM trained on 8K sequence length for up to 1.5T tokens. XGen achieved amazing results in both language and coding tasks —> Read more. Textbooks is All You NeedIn a fascinating paper, Microsoft Research introduced phi-1, a transformer model for coding trained in high quality text book data. Despite having only 1.3B parameters, phi-1 to match the quality of larger alternatives —> Read more. 🤖 Cool AI Tech ReleasesLMFlowLMFlow is an open source toolkit for fine-tuning large foundation models —> Read more. Open LLM LeaderboardHugging Face provided an update about the helpful and controversial Open LLM Leaderboard —> Read more. Chat ArenaChat Arena is an open source game environment to enab,le research about autonomous LLM agents —> Read more. MediaPipe Diffusion PluginsGoogle Research open sourced text-to-image plugins for its MediaPipe on-device ML framework —> Read more. 🛠 Real World MLMeta AI CardsMeta AI released a series of cards that document the ML use cases across Facebook and Instagram —> Read more. Real Time ML at LyftLyft discusses the architecture behind Real-time Machine Learning with Streaming initiative which allow developers to incorporate real time ML capabilities into their applications —> Read more. Declarative Data Pipelines at LinkedInLinkedIn provided an overview of the architecture and tech powering their declarative data pipelines —> Read more. 📡AI Radar
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
💡Webinar: Designing & Scaling FanDuel's ML Platform—Best Practices & Lessons Learned
Friday, June 30, 2023
Discover FanDuel's journey in building a powerful ML platform for personalized experiences. Join the webinar on July 11 at 9 am PT to learn how they scaled their platform and implemented best
Edge 304: Inside AlphaDev: DeepMind’s Newest Breakthrough Model that Was Able to Discover New Computer Science Alg…
Thursday, June 29, 2023
Built on the foundation created by AlphaZero, the model discovered new and improved existing sorting algorithms.
The Sequence Chat: Daniel J. Mankowitz, DeepMind on Building AlphaDev to Discover New Computer Science Algorithms
Wednesday, June 28, 2023
One of the researchers behind DeepMind's groundbreaking model that discovered new sorting algorithms shares his insights about the experience.
Edge 303: The Top Two Types Retrieval-Augmented Language Models
Tuesday, June 27, 2023
What are the main types of techniques to augment LLMs with external information.
📝 Guest Post: Choosing the Right Vector Index For Your Project*
Monday, June 26, 2023
In this post, Frank Liu. ML Architect at Zilliz, discusses vector databases and different indexing strategies for approximate nearest neighbor search. The options mentioned include brute-force search,
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your