A New Compute Platform for Generative AI ?
Was this email forwarded to you? Sign up here A New Compute Platform for Generative AI ?Is generative AI big enough to spark the creation of a new compute platform?Next Week in The Sequence:
You can subscribe below!📝 Editorial: Would Generative AI Require New Hardware Platforms?One of the best-known secrets in tech investing is that any sufficiently large tech trend can create a new computing platform. The advancements in microprocessor design in the late '70s sparked the creation of personal computing. The evolution of the internet enabled the creation of the web browser in the '90s. Similarly, advancements in mobile computing led to the smartphone revolution a few years ago. You can also make the case that devices like Alexa or Google Home have become new compute platforms on a smaller scale. Would AI have triggered the creation of a new compute platform? The answer is far from trivial and is somehow based on the balance between the revolutionary impact of generative AI and the footprint of existing computing platforms. Given that the technology market has grown at a multi-exponential clip, every generation of new compute platforms requires a bigger effort to disrupt the existing platforms that have a well-established footprint. In the case of generative AI, would a new compute platform need to capture a significant percentage of use cases that won't take place on existing platforms such as web browsers, smartphones, or home devices? Over the last few months, we have seen the inception of efforts such as OpenAI working with famous designer Jony Ive to design a new device for generative AI. Also, initial efforts such as the Humane Pin are showcasing new interaction paradigms with generative AI. One of the most interesting keynotes/announcements at last week's CES came from Rabbit with a new generative AI device called R1. The new device is based on a simplistic design that includes a 2.88-inch touchscreen, a rotating camera for taking photos and videos, and a scroll wheel. However, the most intriguing feature of R1 is that it relies on a new foundation model based on the Large Action Model (LAM) paradigms. LAMs are language models optimized for performing actions on external systems. Rabbit seems to have developed an entire tech stack, dubbed Rabbit OS, for the development of LAMs. The release of R1 is one of the most complete examples of how a new platform for generative AI could look. A new compute platform for generative AI is a seductive idea but also a massive undertaking. 🔎 ML ResearchFair LLM ServingA group of stellar researchers from UC Berkeley, Stanford University and Duke University published a paper proposing a technique for LLM serving fairness. Specifically, the algorithm called eVirtual TokenCounter(VTC) uses a cost function based on the number of input and output tokens —> Read more. TinyLlamaResearchers from StatNLP Research Group, Singapore University adn others published a paper unveiling TinyLlama, a 1.1 B LLM pretrained on one trillion tokens. TinyLlama shows the potential of small LLMs by performing incredibly well across different tasks —> Read more. Diffusion DPOSalesforce Research published a paper detailing Diffusion DPO to streamline the adoption of human feedback in text-to-image models. Diffusion DPO incorproates the efficient irect Preference Optimization (DPO) training method to text-to-image models —> Read more. CosMoResearchers from Microsoft and the National University of Singapore published a paper detailing CosMo, a new pretraining method for vision-language models. The method work efficiently for both image and video models —> Read more. Responsible AIMicrosoft Research published a series of paper outlining their latest work in responsible AI. The collection includes areas such as privacy, testing, human feedback, transparency and several others —> Read more. 🤖 Cool AI Tech Releasesr1The launch of Rabbit’s r1 AI device was one of the most interesting highlights of CES —> Read more. GPT StoreOpenAI unveiled a version of custom versions of ChatGPT —> Read more. 🛠 Real World MLAd Optimization at PinterestPinterest discusses the ML architecture powering its ad optimization infrastructure —> Read more. The ML Behind the New York Times CrosswordThe New York Times shares some details of the hardwriting recognition models behind its famous crossworld puzzle —> Read more. 📡AI Radar
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
The Sequence Chat: Arjun Sethi on Venture Investing in Generative AI
Friday, January 12, 2024
The founder and CIO of an enterprise VC powerhouse shares his thoughts about the generative AI market.
Meet Ghostbuster: An AI Technique for Detecting LLM-Generated Content
Friday, January 12, 2024
Created by Berkeley University, the new method uses a probability distribution method to detect the likelihood of AI-generated tokens within a document.
Edge 359: Understanding Tree-Of-Thoughts in LLM Reasoning
Tuesday, January 9, 2024
A variation of chain-of-thought for evaluating different reasoning paths.
The Transformer Robots are Here, Just a Different Kind
Sunday, January 7, 2024
An impressive week in robotic models from both DeepMind and Stanford University and much more...
Edge 358: Inside AGENTS: An Open Source Framework for Autonomous Language Agents
Thursday, January 4, 2024
The framework includes the core building blocks to enable autonomous agents based applications.
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your