TheSequence - More Super Models is All We Need
Was this email forwarded to you? Sign up here Next Week in The Sequence:
You can subscribe below!📝 Editorial: More Super Models is All We NeedThe release of new foundation models is nothing new in this ever-evolving generative AI market. Yet, last week felt quite overwhelming. I sat down to write this editorial on Friday morning, afraid that I might have missed some new announcements at the end of the week. This fear stemmed from experiencing one of the most impressive weeks in the history of generative AI technology. In just a few days, we witnessed the announcements of five mega generative AI models by some of the major players in the space. What is even more impressive is that each of these releases is pushing a specific line of innovation within generative AI, rather than merely copying others. Let’s do a quick recap to put things in context.
How's that for a single week? These releases are not only likely to play a significant role in the next generation of generative AI applications, but they are also championing new and unique innovations in the space. Keep the supermodels coming! 📌 Mastering AI and ML at Production Scale at the apply() Virtual ConferenceJoin the next apply() virtual conference on Wednesday, April 3, for a free event that brings together the engineering community to master AI and ML in production. Since 2021, apply() has hosted more than 24,000 people with a single purpose: helping people advance their skills and expertise in AI/ML. Experienced engineers and visionaries in the industry will share best practices and actionable guidance for transitioning from experimental models to highly scalable applications. In the past, Databricks CEO Ali Ghodsi and Min Cai from Uber shared invaluable insights, covering everything from LLMs to best practices for building scalable machine learning platforms - and there’s even more planned for April! 🔎 ML ResearchV-JEPAMeta AI published a paper and source code detailingVideo Joint Embedding Predictive Architecture( V-JEPA), another model towards their self-supervised learning vision. V-JEPA learns by predicting missing types of videos in an abstract representation space —> Read more. More Agents is All You NeedTencent AI Research published an interesting paper proposing a paper to enhance the performance of LLMs using a sampling and voting method. The technique seems to scale with the number of agents initiated and its performance is also proportional to the complexity of the task —> Read more. MGIEResearchers from Apple and UC Santa Barbara published a paper detailing MLLM-Guided Image Editing(MGIE), an instruction-based image editing model. MGIE takes expressive instructions as input and derives explicit guidance —> Read more. MOEs and Scaling LawsResearchers from Google DeepMind and several universities published a paper that highlights some insights about the scaling laws in mixture of experts(MoEs) architectures. The core contribution of the paper shows that MoE architectures result in more parameter scalable models —> Read more. GraphRAGMicrosoft Research published details about GraphRAG, a technique used to build knowledge graphs in private datasets using the context knowledge of LLMs. GraphRAG improves over traditional RAG techniques when operating in complex private datasets —> Read more. 🤖 Cool AI Tech ReleasesSoraOpenAI unveiled a preview of Sora, an astonishing video generation model —> Read more. AyaCohere open sourced Aya, an instruction fine-tuned, multilingual LLM with support for over 100 languages —> Read more. Gemini 1.5Google unveiled the next version of Gemini just a week after its prior release —> Read more. Chat with RTXNVIDIA launched Chat with RTX, a demo to run an LLM agent in a local computer and personalized with data stored in a Windows PC —> Read more. Stable CascadeStability AI open sourced Stable Cascade, a new text-to-image model that is easier to fine-tune and optimized —> Read more. ChtGPT MemoryOpenAI announced new memory capabilities for ChatGPT —> Read more. LangSmithLangChain announced the general availability of LangSmith, its tool for LLM testing and monitoring —> Read more. 🛠 Real World MLFlyteInteractiveLinkedIn discusses details about FlyteInteractive, a tool for debugging and interacting with AI models deployed in Kubernetes pods —> Read more. 📡AI Radar
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
Edge 369: LLM Reasoning with Chain-Of-Code
Tuesday, February 13, 2024
Can LLMs use code generation to reason through complex tasks?
Don't Overlook China's Open Source LLMs
Sunday, February 11, 2024
A version of a Chinese LLM tops the open LLM leaderboard.
💡WEBINAR: Beyond fine-tuning. Approaches in LLM optimization
Friday, February 9, 2024
We've talked about tuning, and we've talked about prompt engineering, but those are not the only techniques at our disposal to optimize LLMs. Join us for the next webinar of our LLM series on 📅
Edge 368: Inside MemGPT: A Framework for Building Autonomous Agents You Should Know About
Thursday, February 8, 2024
Built by AI researchers from UC Berkeley and inspired by operating systems architectures, MEMGPT enables the core building blocks for agent-based applications.
Edge 367: Understanding Multi-Chain Reasoning in LLMs
Tuesday, February 6, 2024
One of the most interesting techniques used for more complex reasoning in LLMs.
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your