Maybe Two Big Research Breakthroughs or Maybe Nothing
Was this email forwarded to you? Sign up here Maybe Two Big Research Breakthroughs or Maybe NothingMulti-token prediction and a multi-layer perceptron alternative.Next Week in The Sequence:
You can subscribed to The Sequence below:📝 Editorial: Maybe Two Big Research Breakthroughs or Maybe NothingResearch breakthroughs always command a lot of attention in the generative AI space, as they can drive a new wave of innovation for foundation models. However, most of the so-called breakthroughs in the research sphere rarely make it into real implementations, so it's hard to determine whether a new breakthrough will really be influential or not. Last week was particularly interesting because we saw two papers that, on the surface, seem to be quite a big deal for generative AI models, but it's quite hard to determine if they are ready for prime-time. 1. Multi-Token Predictions In a paper titled "Better & Faster Large Language Models via Multi-token Prediction," several AI labs, including Meta AI, proposed a method for multi-token prediction in LLMs. It is a known fact that LLMs work by predicting one token at a time, so this seems like a big leap forward. In theory, the method works by predicting 'n' tokens using 'n' independent output heads. One of the tests with a 13B parameter model seems to solve 12% more problems on the HumalEval dataset than next-token model equivalents. One immediate limitation is that this only seems to be effective in very large models but still seems quite an improvement. 2. MLP Alternative The multi-layer perceptron (MLP) is, arguably, the most important algorithm in the history of deep learning. In a paper titled "KAN: Kolmogorov-Arnold Networks (KANs)," researchers from MIT proposed an audacious alternative to MLP. The main innovation relies on using learnable activation functions on weights instead of the fixed activation functions on nodes in MLP. While conceptually sound, KANs have only been evaluated in a few targeted use cases. Both the multi-token prediction and the KAN paper are clear indicators of the amazing speed of progress in AI research. Both papers are trying to disrupt established techniques. They both seem to be relevant breakthroughs on the surface. Or maybe not 😊" 🔥 Announcing Galileo Protect: Real-Time Hallucination FirewallCan you stop hallucinations in real-time? We’re excited to support Galileo Protect – an advanced GenAI firewall that intercepts hallucinations, prompt attacks, security threats, and more! See Galileo Protect live in action and learn: 🔎 ML ResearchLLM Juries vs. JudgesCohere research published a paper proposing a technique to evaluate LLMs using a panel of diverse models. The method looks to mitigate the bias of single-model evaluation techniques and shows a strong performance across different evaluation benchmarks —> Read more. Iterative ReasoningResearchers from MetaAI and New York University published a paper outlining an iteractive reasoning technique for LLMs. The essence of the method is to optimize across different chain-of-thought(CoT) options to lead to the correct answer —> Read more. Gemini in MedicineGoogle DeepMind published a paper introducing Med-Gemini, a multimodal version of Gemini specialized in medical tasks. Med-Gemini was evaluated across 14 medical benchmarks outperforming GPT-4 in several of them —> Read more. Multi-Token PredictionResearchers from Meta AI, CERMICS Ecole des Ponts Paris Tech and LISN Université Paris-Saclay published a paper proposing an intriguing technique for multi-token prediction in LLMs. At each position of the training corpus, the method can predict a number of future tokens —> Read more. MLP PredictionResearchers from Massachusetts Institute of Technology, California Institute of Technology and Northeastern University published a paper introducing e KolmogorovArnold Networks (KANs), promising alternatives to Multi-Layer Perceptrons (MLPs). The core idea is to switch from the MLP’s fixed actiavation functions on neurons to learnable activation functions on weights —> Read more. Math Benchmark ContaminationResearchers from Scale AI created a new benchmark for elementary math reasoning similar to GSM8k. The results show that several top models such as Mistral and Phi might be overfitting for existing math benchmarks while others such as Gemini, GPT-4 and Claude aren’t —> Read more. 🤖 Cool AI Tech ReleasesXTunerXtuner is a new toolkit for fine-tuning LLMs that is gaining rapid traction —> Read more. Amazon QAmazon Q, its Copilot alternative, hit general availability —> Read more. Claude AppAnthropic released a new iOS App for Claude as well as a new teams subscription plan. Jamba Instruct AI21 released Jamba Instruct, an instruction fine-tuned version of their Jamba model that combines SSMs and transformers —> Read more. 🛠 Real World MLAirbnb BrandometerAirbnb discusses their AI language techniques to understand brand perception in social media channels —> Read more. Amazon AI-Enhanced Catalogue DataAmazon dives into the ML techniques used to evaluate the effectiveness of AI-enhaced catalogue data —> Read more. 📡AI Radar
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
Edge 392: Meet RAFT: UC Berkeley's New Method to Improve RAG Patterns in LLMs
Thursday, May 2, 2024
The method brings the best of RAG and supervised fine tuning. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Edge 391: Autonomous Agents and LLM Function Calling
Tuesday, April 30, 2024
LLMs that invoke external functions, UC Berkeley's LLM Compiler and the Phidata framework. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Nobody Likes a Know-It-All: Smaller LLMs are Gaining Momentum
Sunday, April 28, 2024
Phi-3 and OpenELM, two major small model releases this week. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Edge 390: Diving Into Databricks' DBRX: One of the Most Impressive Open Source LLMs Released Recently
Thursday, April 25, 2024
The model uses an MoE architecture which exhibits remarkable perfromance on a relatively small budget. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Edge 389: Understanding Large Action Models
Tuesday, April 23, 2024
One of the most important concepts in autonomous agents. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your