The Most Amazing Week in Gen AI Releases
Was this email forwarded to you? Sign up here The Most Amazing Week in Gen AI ReleasesOpenAI, Google, Microsoft, Cohere and others shipped new models.Next Week in The Sequence
You can subscribe to The Sequence below:📝 Editorial: Models, Models, Models: The Most Amazing Week in Gen AI ReleasesAs we are approaching the holidays it seems that every major AI lab decided to release their latest models. Without a doubt, last week has to be one of the most impressive weeks in the history of generative AI in terms of model releases with Microsoft, OpenAI , Google, Cohere and others shipping new models. Take a look:
Even by generative AI standards, last week classifies as impressive. The level of innovation in this market is something the tech industry hasn’t seen since the personal computer revolution. Quite remarkable. 🔎 ML ResearchPhi-4In "Phi-4 Technical Report", researchers from Microsoft Research developed a 14-billion parameter language model named phi-4. Phi-4 is trained using a data-centric approach that prioritizes data quality, incorporating synthetic data generated through multi-step prompting workflows and curated high-quality organic data —> Read more. Bag of NuggetsIn "BoN Jailbreaking: Bypassing Safety Measures in Multi-Modal LLMs", researchers from Anthropic, Speechmatics, MATS, UCL, Stanford University, University of Oxford, Tangentic, and others present a new method called "Bag of Nuggets" (BoN). BoN jailbreaking involves generating a large number of augmented inputs, typically 10,000, by applying various perturbations to a harmful request, such as character scrambling, random capitalization, and character noising —> Read more. Video Creation by DemonstrationIn "Video Creation by Demonstration", researchers from Google DeepMind introduce a novel video generation task and a corresponding method called 𝛿-Diffusion. 𝛿-Diffusion allows users to create videos that continue from a given context image while incorporating action concepts from a demonstration video, enabling creative control and realistic video synthesis —> Read more. JuStRankIn "JuStRank: Benchmarking LLM Judges for System Ranking", researchers from IBM Research introduce JuStRank, the first large-scale benchmark for evaluating LLM judges for ranking target systems. The study examines various LLM judges and aggregation methods, comparing their system rankings to a human-based ranking, providing insights into judge behavior and bias —> Read more. ScribeAgentIn "ScribeAgent: Fine-Tuning Open-Source LLMs for Enhanced Web Navigation", researchers from Carnegie Mellon University present ScribeAgent, an approach that leverages fine-tuning of open-source LLMs on a large dataset of real-world web workflows. By fine-tuning Qwen models on a massive dataset of user-annotated web workflows, ScribeAgent surpasses GPT-4-based agents on various web navigation benchmarks —> Read more. Meta’s New ResearchMeta AI published a detailed list of recent research in terms of agentic workflows, and nnew architectures. The research includes Meta Motivo for controlling embodied agent behviors and Meta Video for watermarking —> Read more. 🤖 AI Tech ReleasesGemini 2.0Google unveiled Gemini 2.0, its new model for agentic workflows —> Read more. SoraOpenAI released the first version of Sora, its highly anticipated text-to-video model —> Read more. Command R7BCohere released Command R7B, a small LLM focused on enterprise AI apps —> Read more. Fast-LLMServiceNow open sourced the framework Fast-LLM to streamline pretraining of foundation models —> Read more. Pika 2.0Pika announced its 2.0 model with quite a bit of new features —> Read more. 🛠 Real World AIInside AgentforceSalesforce discusses the reasoning engine behind its Agentforce platform —> Read more. 📡AI Radar
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
📽 Webinar: How To Maximize Model Accuracy
Thursday, December 19, 2024
Struggling to keep your production ML models accurate without an endless budget? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Edge 457: Can we Distill Specific Knowledge in LLMs? An Intro to Attention-Based Distillation
Thursday, December 19, 2024
One of the most interesting distillation techniques for foundation models. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The Sequence Chat: Can AI Solve The Riemann Hypothesis? Some Ideas About the Progress and Limitations of AI in Sci…
Thursday, December 19, 2024
AI has proven that can help advance scientific fields but how far can that go and what are the pragmatic limitations? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The Sequence Chat: The One Area in Which China can Dominate the US in the AI Race
Wednesday, December 11, 2024
Might come as a surprise. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Edge 455: Building Smaller Foundation Models Using Graph-Based Distillation
Tuesday, December 10, 2024
Diving into one of the most sophisticated distillation methods in the gen AI space. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your