The AlphaDev Milestone: A New Model that is Able to Discover and Improve Algorithms
Was this email forwarded to you? Sign up here The AlphaDev Milestone: A New Model that is Able to Discover and Improve AlgorithmsSundays, The Sequence Scope brings a summary of the most important research papers, technology releases and VC funding deals in the artificial intelligence space.Next Week in The Sequence:
📝 Editorial: The AlphaDev Milestone: A New Model that is Able to Discover and Improve AlgorithmsWith all the hype around LLMs and foundation models, sometimes we ignore other areas of machine learning. Last week, DeepMind unveiled a major breakthrough in AI that could signal an important element of the road to AGI. What is a good test for Artificial General Intelligence (AGI)? This question has been central to the evolution of AI for decades. While the Turing Test is a generic answer, the real answer becomes more complicated when delving into the specifics. Is there a single test that can indicate the emergence of AGI? Obviously, there are countless variations of the Turing Test that can be modeled. However, in my opinion, there is a category that stands out as a leading indicator of AGI-like foundations—the discovery of new science and, more specifically, the discovery of new algorithms. Algorithm modeling requires various cognitive skills, such as multi-step reasoning, planning, and empirical evaluation, among others. Many branches of mathematics, computer science, or physics rely on a set of core foundational algorithms. Despite the progress in computer science, breakthroughs in foundational algorithms have significantly slowed down due to the high bar they must meet. Foundational problems like sorting, searching, matrix multiplication, or combinatorics have had established solutions for decades. Can the new generation of foundation models help improve some of the core algorithms that were used to create them? A few months ago, DeepMind unveiled AlphaTensor, a new model that helps discover a more efficient matrix multiplication algorithm that hasn't seen improvement in 50 years. Building on that work, DeepMind has now unveiled AlphaDev, a reinforcement learning (RL) model that discovered faster sorting algorithms and improved existing ones. The model is based on AlphaZero, an RL model that achieved superhuman performance in various games like Go, chess, and shogi. Not surprisingly, the AlphaDev environment was modeled as a single-player game in which the model observes an algorithm and experiments with different instructions to improve it. AlphaDev was not only able to discover faster algorithms using existing methods but also some based on completely novel approaches. If you are a classically trained computer scientist, the idea of discovering new sorting algorithms may seem unfathomable. The core discussion about the path to AGI has been centered around foundation models. Yet, DeepMind is using RL to discover new algorithms. First matrix multiplication, and now sorting. Do you know what those two techniques are foundational to? That's right: AI. 🔎 ML ResearchAlphaDevDeepMind published a paper detailing AlphaDev, a new reinforcement learning method able to discover new algorithms. The model was based on AlphaZero and trained on a single-player assembly game based on potential instructions of the algorithm —> Read more. AVFormerGoogle Research published a paper outlining AVFormer, a method for augmenting large scale audio models with visual representations. The core principle is based on injecting visual embeddings into frozen ASR models to improve their robustness —> Read more. Visual CaptionsGoogle Research published a paper discussing a technique that generates visuals based on real time video conference streams. The model is fine tuned using a dataset of visuals that are appropriate for video conference conversations —> Read more. 3D UnderstandingSalesforce Research published papers detailing ULIP and ULIP-2, two techniques used to understand 3D objects. Both technique are based on multimodal methods that can process image, language and 3D cloud data —> Read more. ReLMResearchers from Carnegie Mellon University published a paper introducing ReLM, a model that can query LLMs using regular expressions. ReLM is relevant for tasks such as validating aspects such as memorization, bias or toxicity in LLMs —> Read more. 📍 Live Tutorial: Working with LLMs at ScaleThis free event on June 15th explores LLMs and the two main problems they face when it comes to production: high cost and lack of domain knowledge. Discover how vector databases can be a solution by facilitating data injection and caching through the use of vector embeddings. The virtual session ends with a hands-on tutorial where you can build an LLM application using LlamaIndex and Milvus. 🤖 Cool AI Tech ReleasesPaLM in VertexGoogle announced support for its PaLM and PaLM2 modes in the Vertex platform —> Read more. Chat NotebooksStephen Wolfram published an incredible blog post about a new paradigm that combines LLMs and notebooks —> Read more. CodeTFSalesforce Research open source CodeTF, a Python library for code LLMs —> Read more. 📡AI Radar
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
📺 See how programmatic labeling is the key to using LLMs [Live Demo]
Monday, June 5, 2023
Even with the rapid advancements to AI made possible by LLMs and Foundation Models, data remains the key to unlocking real value for enterprise AI. Join us at this live demo, where Snorkel AI co-
The Next RLHF Effect: Three Breakhroughts that can Unlock the Next Wave of Innovation in Foundation Models
Sunday, June 4, 2023
Sundays, The Sequence Scope brings a summary of the most important research papers, technology releases and VC funding deals in the artificial intelligence space.
This week on TuringPost
Saturday, June 3, 2023
Hi there, Last week, we introduced Turing Post – the newsletter that complements TheSequence and covers other topics that help you make informed decisions about AI. It is for those who are in the AI
📝 Guest Post: Stop Hallucinations From Hurting your LLM Powered Apps*
Friday, June 2, 2023
LLM hallucinations pose a big threat to the successful adoption of the new wave of LLM apps. In this post, the Galileo team dives into how one can prevent hallucinations from creeping in, as well as
Edge 296: Inside OpenAI's Method to Use GPT-4 to Explain Neuron's Behaviors in GPT-2
Thursday, June 1, 2023
The technique is one of the first attempts to utilize LLMs as a explainability foundation.
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your