The AlphaDev Milestone: A New Model that is Able to Discover and Improve Algorithms
Was this email forwarded to you? Sign up here The AlphaDev Milestone: A New Model that is Able to Discover and Improve AlgorithmsSundays, The Sequence Scope brings a summary of the most important research papers, technology releases and VC funding deals in the artificial intelligence space.Next Week in The Sequence:
📝 Editorial: The AlphaDev Milestone: A New Model that is Able to Discover and Improve AlgorithmsWith all the hype around LLMs and foundation models, sometimes we ignore other areas of machine learning. Last week, DeepMind unveiled a major breakthrough in AI that could signal an important element of the road to AGI. What is a good test for Artificial General Intelligence (AGI)? This question has been central to the evolution of AI for decades. While the Turing Test is a generic answer, the real answer becomes more complicated when delving into the specifics. Is there a single test that can indicate the emergence of AGI? Obviously, there are countless variations of the Turing Test that can be modeled. However, in my opinion, there is a category that stands out as a leading indicator of AGI-like foundations—the discovery of new science and, more specifically, the discovery of new algorithms. Algorithm modeling requires various cognitive skills, such as multi-step reasoning, planning, and empirical evaluation, among others. Many branches of mathematics, computer science, or physics rely on a set of core foundational algorithms. Despite the progress in computer science, breakthroughs in foundational algorithms have significantly slowed down due to the high bar they must meet. Foundational problems like sorting, searching, matrix multiplication, or combinatorics have had established solutions for decades. Can the new generation of foundation models help improve some of the core algorithms that were used to create them? A few months ago, DeepMind unveiled AlphaTensor, a new model that helps discover a more efficient matrix multiplication algorithm that hasn't seen improvement in 50 years. Building on that work, DeepMind has now unveiled AlphaDev, a reinforcement learning (RL) model that discovered faster sorting algorithms and improved existing ones. The model is based on AlphaZero, an RL model that achieved superhuman performance in various games like Go, chess, and shogi. Not surprisingly, the AlphaDev environment was modeled as a single-player game in which the model observes an algorithm and experiments with different instructions to improve it. AlphaDev was not only able to discover faster algorithms using existing methods but also some based on completely novel approaches. If you are a classically trained computer scientist, the idea of discovering new sorting algorithms may seem unfathomable. The core discussion about the path to AGI has been centered around foundation models. Yet, DeepMind is using RL to discover new algorithms. First matrix multiplication, and now sorting. Do you know what those two techniques are foundational to? That's right: AI. 🔎 ML ResearchAlphaDevDeepMind published a paper detailing AlphaDev, a new reinforcement learning method able to discover new algorithms. The model was based on AlphaZero and trained on a single-player assembly game based on potential instructions of the algorithm —> Read more. AVFormerGoogle Research published a paper outlining AVFormer, a method for augmenting large scale audio models with visual representations. The core principle is based on injecting visual embeddings into frozen ASR models to improve their robustness —> Read more. Visual CaptionsGoogle Research published a paper discussing a technique that generates visuals based on real time video conference streams. The model is fine tuned using a dataset of visuals that are appropriate for video conference conversations —> Read more. 3D UnderstandingSalesforce Research published papers detailing ULIP and ULIP-2, two techniques used to understand 3D objects. Both technique are based on multimodal methods that can process image, language and 3D cloud data —> Read more. ReLMResearchers from Carnegie Mellon University published a paper introducing ReLM, a model that can query LLMs using regular expressions. ReLM is relevant for tasks such as validating aspects such as memorization, bias or toxicity in LLMs —> Read more. 📍 Live Tutorial: Working with LLMs at ScaleThis free event on June 15th explores LLMs and the two main problems they face when it comes to production: high cost and lack of domain knowledge. Discover how vector databases can be a solution by facilitating data injection and caching through the use of vector embeddings. The virtual session ends with a hands-on tutorial where you can build an LLM application using LlamaIndex and Milvus. 🤖 Cool AI Tech ReleasesPaLM in VertexGoogle announced support for its PaLM and PaLM2 modes in the Vertex platform —> Read more. Chat NotebooksStephen Wolfram published an incredible blog post about a new paradigm that combines LLMs and notebooks —> Read more. CodeTFSalesforce Research open source CodeTF, a Python library for code LLMs —> Read more. 📡AI Radar
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
📺 See how programmatic labeling is the key to using LLMs [Live Demo]
Monday, June 5, 2023
Even with the rapid advancements to AI made possible by LLMs and Foundation Models, data remains the key to unlocking real value for enterprise AI. Join us at this live demo, where Snorkel AI co-
The Next RLHF Effect: Three Breakhroughts that can Unlock the Next Wave of Innovation in Foundation Models
Sunday, June 4, 2023
Sundays, The Sequence Scope brings a summary of the most important research papers, technology releases and VC funding deals in the artificial intelligence space.
This week on TuringPost
Saturday, June 3, 2023
Hi there, Last week, we introduced Turing Post – the newsletter that complements TheSequence and covers other topics that help you make informed decisions about AI. It is for those who are in the AI
📝 Guest Post: Stop Hallucinations From Hurting your LLM Powered Apps*
Friday, June 2, 2023
LLM hallucinations pose a big threat to the successful adoption of the new wave of LLM apps. In this post, the Galileo team dives into how one can prevent hallucinations from creeping in, as well as
Edge 296: Inside OpenAI's Method to Use GPT-4 to Explain Neuron's Behaviors in GPT-2
Thursday, June 1, 2023
The technique is one of the first attempts to utilize LLMs as a explainability foundation.
You Might Also Like
Transformers are Eating Quantum
Sunday, November 24, 2024
DeepMind's AlphaQubit addresses one of the main challenges in quantum computing. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Retro Recomendo: Gift Ideas
Sunday, November 24, 2024
Recomendo - issue #438 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Kotlin Weekly #434
Sunday, November 24, 2024
ISSUE #434 24th of November 2024 Hi Kotliners! Next week is the last one to send a paper proposal for the KotlinConf. We hope to see you there next year. Announcements State of Kotlin Scripting 2024
Weekend Reading — More time to write
Sunday, November 24, 2024
More Time to Write A fully functional clock that ticks backwards, giving you more time to write. Tech Stuff Martijn Faassen (FWIW I don't know how to use any debugger other than console.log) People
🕹️ Retro Consoles Worth Collecting While You Still Can — Is Last Year's Flagship Phone Worth Your Money?
Saturday, November 23, 2024
Also: Best Outdoor Smart Plugs, and More! How-To Geek Logo November 23, 2024 Did You Know After the "flair" that servers wore—buttons and other adornments—was made the butt of a joke in the
JSK Daily for Nov 23, 2024
Saturday, November 23, 2024
JSK Daily for Nov 23, 2024 View this email in your browser A community curated daily e-mail of JavaScript news React E-Commerce App for Digital Products: Part 4 (Creating the Home Page) This component
Not Ready For The Camera 📸
Saturday, November 23, 2024
What (and who) video-based social media leaves out. Here's a version for your browser. Hunting for the end of the long tail • November 23, 2024 Not Ready For The Camera Why hasn't video
Daily Coding Problem: Problem #1617 [Easy]
Saturday, November 23, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Microsoft. You are given an string representing the initial conditions of some dominoes.
Ranked | The Tallest and Shortest Countries, by Average Height 📏
Saturday, November 23, 2024
These two maps compare the world's tallest countries, and the world's shortest countries, by average height. View Online | Subscribe | Download Our App TIME IS RUNNING OUT There's just 3
⚙️ Your own Personal AI Agent, for Everything
Saturday, November 23, 2024
November 23, 2024 | Read Online Subscribe | Advertise Good Morning. Welcome to this special edition of The Deep View, brought to you in collaboration with Convergence. Imagine if you had a digital