Four New Major Open Source Foundation Models in a Week
Was this email forwarded to you? Sign up here Four New Major Open Source Foundation Models in a WeekDBRX, Grok 1.5, Samba-CoE and Jamba are all bringing unique innovations to open source generative AI.Next Week in The Sequence:
You can subscribe to The Sequence using the link below:📝 Editorial: Four New Major Open Source Foundation Models in a WeekOpen source generative AI is experiencing tremendous momentum, and last week was a major example of this with the release of four major foundation models. By open source, we refer to the weights of the models and not the training datasets or processes. At this time, it's fair to say that the model weights are where most companies draw the line between open source and closed source. Many purists do not consider this true open source, but in a field evolving as rapidly as generative AI, preserving a level of competitive advantage is essential for any company. Let’s just say that the nature of open source is being reimagined for generative AI. The fast pace of generative AI also makes the open source race even more fascinating. Last week, we witnessed the release of four major open source models, each innovative in its own way:
Regardless of where you fall in the commercial vs. open source debate in generative AI, it is undeniable that the latter will play a major role in the mainstream adoption of this technology. This week shows how strong the momentum in open source generative AI is." With just one week left until apply() ‘24, the premier virtual conference for engineers mastering AI and ML, we wanted to remind you to secure your spot before it's too late! Date: Wednesday, April 3 / 9:00AM – 5:00PM PT / Virtual At apply(), our goal is to provide you with the tools and insights you need to conquer AI and ML challenges at production scale. With speakers from LangChain, Meta, Pinterest, Vanguard, Visa, Samsung, NextDoor, and many more in the lineup, this year's event promises to be our best yet. Be sure to join live for the chance to win swag or a giveaway prize! 🔎 ML ResearchCan LLMs Explore?Researchers from Microsoft and Carnegie Mellon University published a paper exploring the intriguing thesis of LLM’s ability to engage in exploration, an ability typically reserved for reinforcement learning models. The research describes environments such as multi-armed bandits in prompts and determine whether LLMs can explore the environment in order to take actions —> Read more. Tnt-LLMMicrosoft Research published a paper introducing Tnt-LLM, an LLM framework that generates and predict task labels with minimum user involvement. Tnt-LLM is actively used to discover Microsoft CoPilot’s user’s intent —> Read more. AutoBNNGoogle Research published and research adn open sourced AutoBNN, a JAX framework for interpretable time series forecasting models. AutoBNN’s core idea is to combine the interpretability of traditional time series models with the scalability of neural networks in a single architecture —> Read more. SaLEMAmazon Science published a paper introducing SaLEM (for salient-layers editing model), a method for editing layers in an LLM. SaLEM’s key contribution is that it can actually select the layers to be edited automatically —> Read more. SAFEGoogle DeepMind published a paper presenting Search-Augmented Factuality Evaluator (SAFE), a method for factual evaluation in LLMs using synthetic data. SAFE breaks down a long LLM response into specific facts and evaluates its individual accuracy —> Read more. 🤖 Cool AI Tech ReleasesDBRXDatabricks released DBRX, a new state-of-the-art open source LLM —> Read more. JambaAI21 Labs open sourced Jamba, a new model that augments Structured State Space model (SSM) with elements of the transformer architecture —> Read more. Samba CoE v0.2Samba Nova previewed the performance of Samba CoE v0.2, a new version of Samba-1 which scored incredibly high across many benchmarks —> Read more. Grok 1.5X.ai released Grok 1.5 with improved content reasoning capabilities and larger content length —> Read more. Voice EngineOpenAI published some details about Voice Engine, a new model for creating custom voices —> Read more. 🛠 Real World MLVideo Content Moderation at YelpYelp discusses the ML architecture powering its video content moderation solution —> Read more. 📡AI Radar
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
Edge 381: Google DeepMind's PrompBreeder Self-Improves Prompts
Thursday, March 28, 2024
The method combines chain of thoughts, plan and solve and evolutionary algorithms in a single mthod. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Edge 380: A New Series About Autonomous Agents
Tuesday, March 26, 2024
The series will cover memory, action execution, planning, collaboration and many other characteristics of autonomous agents. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
📝 Guest Post: Zilliz Unveiled Milvus 2.4 at GTC 24, Transforming Vector Databases with GPU Acceleration*
Monday, March 25, 2024
Collaboration with NVIDIA boosts Milvus performance 50x ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
NVIDIA’s GTC in Four Headlines
Sunday, March 24, 2024
Impressive AI hardware innovations and interesting software moves. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
📌 Exciting lineup for apply() 2024 is now live
Friday, March 22, 2024
Exciting news! The agenda for apply() 2024, Tecton's premier virtual conference dedicated to mastering AI and ML at production scale, is now live! Join us on Wednesday, April 3, for a day packed
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your