Open Source Scored the First Major M&A of the Generative AI Era
Was this email forwarded to you? Sign up here Next Week in The Sequence:
📝 Editorial: Open Source Scored the First Major M&A of the Generative AI EraM&A activity is always interesting to evaluate the health of a tech market. While fundraising activity often forecasts the value of a company in the relatively long term, M&A activity provides a pragmatic view of what exit strategies might look like for a specific segment of companies. Having too much or too little M&A in a market is always bad; you want just the right level of deals to rationalize valuations in a sector. Well, last week, we witnessed the first high-profile M&A transaction in the generative AI space, and it went to the open-source column. Databricks agreed to acquire MosaicML for an astonishing $1.3 billion valuation. MosaicML is a two-year-old company behind the open-source MPT-30B and MPT-7B models, and it has built a state-of-the-art platform for training and fine-tuning foundation models. This deal is incredibly significant for several reasons. Firstly, it demonstrates the real potential of open-source foundation models as a viable alternative to closed, API-based models. I mean, to pay $1B+ for something, you must be truly convinced that these open-source models will match the quality of GPT-4, Claude, and PaLM. If you haven't tried MPT-30B, I think you will be pleasantly surprised by its tremendous quality. Secondly, Databricks' enterprise distribution can act as a strong catalyst for the adoption of MPT models and eliminate barriers for open-source generative AI. Lastly, paying $1.3 billion for a two-year-old company in a highly competitive space might seem irrational, but it shows that Databricks believes the MosaicML platform can unlock $10 billion to $20 billion in value. The MosaicML acquisition follows other significant transactions, such as Snowflake acquiring Streamlit for $800 million last year and Neeva for $150 million this year. Beyond the economics, I believe the Databricks-MosaicML deal is an incredible stamp of approval for open-source ML. Now we should see what Databricks' competitors (like Snowflake 😉 ) do. 🔎 ML ResearchCoDiMicrosoft Research published a paper detailing CoDi, a generative AI model capable of generating content across different modalities such as language, image, audio or video. Together with the paper, Microsoft announced Project i-Code to foment multimodal generative AI —> Read more. ZeRO++Microsoft Research published a paper detailing ZeRO++, a high performance communication pipeline optimized for LLM training. As it names indicates, ZeRO++ is built on top of ZeRO but reduces the communication volume by 4x —> Read more. A Unified Pretraining Strategy for Computer Vision ModelsGoogle Research published a paper unveiling a pretraining strategy that combines image captioning and image classification. The strategy delivers amazing performance in zero shot classification tasks —> Read more. XGenSalesforce Research open sourced XGen, a 7 billion parameter LLM trained on 8K sequence length for up to 1.5T tokens. XGen achieved amazing results in both language and coding tasks —> Read more. Textbooks is All You NeedIn a fascinating paper, Microsoft Research introduced phi-1, a transformer model for coding trained in high quality text book data. Despite having only 1.3B parameters, phi-1 to match the quality of larger alternatives —> Read more. 🤖 Cool AI Tech ReleasesLMFlowLMFlow is an open source toolkit for fine-tuning large foundation models —> Read more. Open LLM LeaderboardHugging Face provided an update about the helpful and controversial Open LLM Leaderboard —> Read more. Chat ArenaChat Arena is an open source game environment to enab,le research about autonomous LLM agents —> Read more. MediaPipe Diffusion PluginsGoogle Research open sourced text-to-image plugins for its MediaPipe on-device ML framework —> Read more. 🛠 Real World MLMeta AI CardsMeta AI released a series of cards that document the ML use cases across Facebook and Instagram —> Read more. Real Time ML at LyftLyft discusses the architecture behind Real-time Machine Learning with Streaming initiative which allow developers to incorporate real time ML capabilities into their applications —> Read more. Declarative Data Pipelines at LinkedInLinkedIn provided an overview of the architecture and tech powering their declarative data pipelines —> Read more. 📡AI Radar
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
💡Webinar: Designing & Scaling FanDuel's ML Platform—Best Practices & Lessons Learned
Friday, June 30, 2023
Discover FanDuel's journey in building a powerful ML platform for personalized experiences. Join the webinar on July 11 at 9 am PT to learn how they scaled their platform and implemented best
Edge 304: Inside AlphaDev: DeepMind’s Newest Breakthrough Model that Was Able to Discover New Computer Science Alg…
Thursday, June 29, 2023
Built on the foundation created by AlphaZero, the model discovered new and improved existing sorting algorithms.
The Sequence Chat: Daniel J. Mankowitz, DeepMind on Building AlphaDev to Discover New Computer Science Algorithms
Wednesday, June 28, 2023
One of the researchers behind DeepMind's groundbreaking model that discovered new sorting algorithms shares his insights about the experience.
Edge 303: The Top Two Types Retrieval-Augmented Language Models
Tuesday, June 27, 2023
What are the main types of techniques to augment LLMs with external information.
📝 Guest Post: Choosing the Right Vector Index For Your Project*
Monday, June 26, 2023
In this post, Frank Liu. ML Architect at Zilliz, discusses vector databases and different indexing strategies for approximate nearest neighbor search. The options mentioned include brute-force search,
You Might Also Like
Retro Recomendo: Gift Ideas
Sunday, November 24, 2024
Recomendo - issue #438 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Kotlin Weekly #434
Sunday, November 24, 2024
ISSUE #434 24th of November 2024 Hi Kotliners! Next week is the last one to send a paper proposal for the KotlinConf. We hope to see you there next year. Announcements State of Kotlin Scripting 2024
Weekend Reading — More time to write
Sunday, November 24, 2024
More Time to Write A fully functional clock that ticks backwards, giving you more time to write. Tech Stuff Martijn Faassen (FWIW I don't know how to use any debugger other than console.log) People
🕹️ Retro Consoles Worth Collecting While You Still Can — Is Last Year's Flagship Phone Worth Your Money?
Saturday, November 23, 2024
Also: Best Outdoor Smart Plugs, and More! How-To Geek Logo November 23, 2024 Did You Know After the "flair" that servers wore—buttons and other adornments—was made the butt of a joke in the
JSK Daily for Nov 23, 2024
Saturday, November 23, 2024
JSK Daily for Nov 23, 2024 View this email in your browser A community curated daily e-mail of JavaScript news React E-Commerce App for Digital Products: Part 4 (Creating the Home Page) This component
Not Ready For The Camera 📸
Saturday, November 23, 2024
What (and who) video-based social media leaves out. Here's a version for your browser. Hunting for the end of the long tail • November 23, 2024 Not Ready For The Camera Why hasn't video
Daily Coding Problem: Problem #1617 [Easy]
Saturday, November 23, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Microsoft. You are given an string representing the initial conditions of some dominoes.
Ranked | The Tallest and Shortest Countries, by Average Height 📏
Saturday, November 23, 2024
These two maps compare the world's tallest countries, and the world's shortest countries, by average height. View Online | Subscribe | Download Our App TIME IS RUNNING OUT There's just 3
⚙️ Your own Personal AI Agent, for Everything
Saturday, November 23, 2024
November 23, 2024 | Read Online Subscribe | Advertise Good Morning. Welcome to this special edition of The Deep View, brought to you in collaboration with Convergence. Imagine if you had a digital
Educational Byte: Are Privacy Coins Like Monero and Zcash Legal?
Saturday, November 23, 2024
Top Tech Content sent at Noon! How the world collects web data Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, November 23, 2024? The HackerNoon