The AlphaDev Milestone: A New Model that is Able to Discover and Improve Algorithms
Was this email forwarded to you? Sign up here The AlphaDev Milestone: A New Model that is Able to Discover and Improve AlgorithmsSundays, The Sequence Scope brings a summary of the most important research papers, technology releases and VC funding deals in the artificial intelligence space.Next Week in The Sequence:
📝 Editorial: The AlphaDev Milestone: A New Model that is Able to Discover and Improve AlgorithmsWith all the hype around LLMs and foundation models, sometimes we ignore other areas of machine learning. Last week, DeepMind unveiled a major breakthrough in AI that could signal an important element of the road to AGI. What is a good test for Artificial General Intelligence (AGI)? This question has been central to the evolution of AI for decades. While the Turing Test is a generic answer, the real answer becomes more complicated when delving into the specifics. Is there a single test that can indicate the emergence of AGI? Obviously, there are countless variations of the Turing Test that can be modeled. However, in my opinion, there is a category that stands out as a leading indicator of AGI-like foundations—the discovery of new science and, more specifically, the discovery of new algorithms. Algorithm modeling requires various cognitive skills, such as multi-step reasoning, planning, and empirical evaluation, among others. Many branches of mathematics, computer science, or physics rely on a set of core foundational algorithms. Despite the progress in computer science, breakthroughs in foundational algorithms have significantly slowed down due to the high bar they must meet. Foundational problems like sorting, searching, matrix multiplication, or combinatorics have had established solutions for decades. Can the new generation of foundation models help improve some of the core algorithms that were used to create them? A few months ago, DeepMind unveiled AlphaTensor, a new model that helps discover a more efficient matrix multiplication algorithm that hasn't seen improvement in 50 years. Building on that work, DeepMind has now unveiled AlphaDev, a reinforcement learning (RL) model that discovered faster sorting algorithms and improved existing ones. The model is based on AlphaZero, an RL model that achieved superhuman performance in various games like Go, chess, and shogi. Not surprisingly, the AlphaDev environment was modeled as a single-player game in which the model observes an algorithm and experiments with different instructions to improve it. AlphaDev was not only able to discover faster algorithms using existing methods but also some based on completely novel approaches. If you are a classically trained computer scientist, the idea of discovering new sorting algorithms may seem unfathomable. The core discussion about the path to AGI has been centered around foundation models. Yet, DeepMind is using RL to discover new algorithms. First matrix multiplication, and now sorting. Do you know what those two techniques are foundational to? That's right: AI. 🔎 ML ResearchAlphaDevDeepMind published a paper detailing AlphaDev, a new reinforcement learning method able to discover new algorithms. The model was based on AlphaZero and trained on a single-player assembly game based on potential instructions of the algorithm —> Read more. AVFormerGoogle Research published a paper outlining AVFormer, a method for augmenting large scale audio models with visual representations. The core principle is based on injecting visual embeddings into frozen ASR models to improve their robustness —> Read more. Visual CaptionsGoogle Research published a paper discussing a technique that generates visuals based on real time video conference streams. The model is fine tuned using a dataset of visuals that are appropriate for video conference conversations —> Read more. 3D UnderstandingSalesforce Research published papers detailing ULIP and ULIP-2, two techniques used to understand 3D objects. Both technique are based on multimodal methods that can process image, language and 3D cloud data —> Read more. ReLMResearchers from Carnegie Mellon University published a paper introducing ReLM, a model that can query LLMs using regular expressions. ReLM is relevant for tasks such as validating aspects such as memorization, bias or toxicity in LLMs —> Read more. 📍 Live Tutorial: Working with LLMs at ScaleThis free event on June 15th explores LLMs and the two main problems they face when it comes to production: high cost and lack of domain knowledge. Discover how vector databases can be a solution by facilitating data injection and caching through the use of vector embeddings. The virtual session ends with a hands-on tutorial where you can build an LLM application using LlamaIndex and Milvus. 🤖 Cool AI Tech ReleasesPaLM in VertexGoogle announced support for its PaLM and PaLM2 modes in the Vertex platform —> Read more. Chat NotebooksStephen Wolfram published an incredible blog post about a new paradigm that combines LLMs and notebooks —> Read more. CodeTFSalesforce Research open source CodeTF, a Python library for code LLMs —> Read more. 📡AI Radar
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
📺 See how programmatic labeling is the key to using LLMs [Live Demo]
Monday, June 5, 2023
Even with the rapid advancements to AI made possible by LLMs and Foundation Models, data remains the key to unlocking real value for enterprise AI. Join us at this live demo, where Snorkel AI co-
The Next RLHF Effect: Three Breakhroughts that can Unlock the Next Wave of Innovation in Foundation Models
Sunday, June 4, 2023
Sundays, The Sequence Scope brings a summary of the most important research papers, technology releases and VC funding deals in the artificial intelligence space.
This week on TuringPost
Saturday, June 3, 2023
Hi there, Last week, we introduced Turing Post – the newsletter that complements TheSequence and covers other topics that help you make informed decisions about AI. It is for those who are in the AI
📝 Guest Post: Stop Hallucinations From Hurting your LLM Powered Apps*
Friday, June 2, 2023
LLM hallucinations pose a big threat to the successful adoption of the new wave of LLM apps. In this post, the Galileo team dives into how one can prevent hallucinations from creeping in, as well as
Edge 296: Inside OpenAI's Method to Use GPT-4 to Explain Neuron's Behaviors in GPT-2
Thursday, June 1, 2023
The technique is one of the first attempts to utilize LLMs as a explainability foundation.
You Might Also Like
Our verdict on the new iPad Pro
Tuesday, May 14, 2024
The Morning After It's Tuesday, May 14, 2024. Apple's new iPad Pro is one of the most divisive (and thinnest) devices the company has made in years. Sure, it's an undeniable feat of
New Cross-Platform Android, iOS Feature Detects Unwanted Bluetooth Tracking Devices
Tuesday, May 14, 2024
THN Daily Updates Newsletter cover Enterprise Transformation to AI and the Metaverse ($59.99 Value) FREE for a Limited Time Strategies for the Technology Revolution Download Now Sponsored LATEST NEWS
Post from Syncfusion Blogs on 05/14/2024
Tuesday, May 14, 2024
New blogs from Syncfusion What is Cybersecurity? By Katherine Dobson This blog post explores simple cybersecurity practices to safeguard your data in today's digital world. Reached 50! A Milestone
Zugu — Always Forward.
Tuesday, May 14, 2024
The last iPad case you need. See the most loved features you can't live without. The form and style of ZUGU cases have evolved naturally, resulting from designing products that safeguard your
Edge 395: Task Decomposition in Autonomous Agents
Tuesday, May 14, 2024
The cornerstone of planning in autonomous agents. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
GPT-4o crushes leaderboard 🧠, cheap humanoid robots 🤖, GameStop 2.0 📈
Tuesday, May 14, 2024
OpenAI announced a new AI model yesterday called GPT-4o that can converse using speech in real time, read emotional cues, and respond to visual input Sign Up |Advertise|View Online TLDR Together With
“You can’t do that, it’s illegal!”
Tuesday, May 14, 2024
When LLMs provide lessons in ethics & morals ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
BetterDev #259 - How LLMs Work, Explained Without Math and Turning AirPods into a Fitness Tracker to Fight Cancer
Monday, May 13, 2024
Better Dev #259 May 13, 2024 Hi all, We come back with a new issue this week. If you like BetterDev, please help spead word out by refer to your friends. Buy me a coffee would be great too. Many link
Meet OpenAI’s newest GPT
Monday, May 13, 2024
Plus: White House to fund semiconductors and Cruise tests in Phoenix View this email online in your browser By Christine Hall Monday, May 13, 2024 Good afternoon, and welcome back to TechCrunch PM. We
The Story of Project Management & SEO ruined the internet
Monday, May 13, 2024
My name is Philipp and you are reading Creativerly, the weekly digest about creativity and productivity-boosting tools and resources, combined with useful insights, articles, and findings from the