TheSequence - The Single-Algorithm AI Chip
Was this email forwarded to you? Sign up here The Single-Algorithm AI ChipPlus a tremendous activity in funding activity in generative AI startups.Next Week in The Sequence:
You can subscribe to The Sequence below:📝 Editorial: The Single-Algorithm AI ChipThe dominance of the transformer architecture in generative AI represents a pivotal moment for the AI chip industry. This revolution has sparked a renaissance in chip design, propelling NVIDIA to become one of the world's most valuable companies and fueling substantial funding for new AI chip startups. The demand for AI-based hardware seems limitless, driven not only by the rapid pace of AI advancements but also by the slow evolution of AI model architectures beyond transformers. Simply put, transformer dominance as the preferred architecture in generative AI is the best thing to have happened to the AI chip industry. The rationale is clear: when most AI software innovation centers around a single architecture, it becomes logical for AI chip manufacturers to optimize for that paradigm. Given that AI chip production cycles are significantly longer than software development cycles, such optimization is only feasible if model architectures remain stable for years. Conversely, constant changes in architecture paradigms would render AI chip optimization impractical and economically unviable. Last week provided a notable example of this market dynamic between AI chips and software: Etched, a new AI chip startup, secured $120 million in funding to develop chips specialized in transformer architectures. Etched's chip, Sohu, is capable of processing 500,000 tokens per second with the throughput of a Llama 70B model, surpassing NVIDIA's Blackwell (B200) GPUs in speed and cost efficiency. Sohu's specialization in a single algorithm allows for a streamlined logic flow, accommodating more mathematical blocks and achieving an impressive 90% FLOPS utilization. The dominance of transformer architecture empowers startups like Etched to optimize chip designs to compete effectively with established industry giants. The greatest paradox of the AI chip renaissance lies in the fact that innovation is spurred not by rapid AI evolution, but by its deliberate pace. 🌝 Recommended – Finally: Instant, accurate, low-cost GenAI evaluationsWhy are Fortune 500 companies everywhere switching to Galileo Luna for enterprise GenAI evaluations?
🔎 ML ResearchFineWebHuggingFace published a paper detailing how they built FineWeb, one of the largest open source datasets for LLM pretraining ever built. FineWeb boosts and impressive 15 trillion tokens from 96 Common Crawl snapshots —> Read more. Agent Symbolic LearningResearchers from AIWaves published a paper introducing a technique known as agent symbolic learning aimed to self-improve agents. The core idea is to draw a parallel between an agent pipeline and a neural net and use symbolic optimizers to improve the agent network —> Read more. APIGenSalesforce Research published a paper introducing APIGen, a pipeline designed to synthesize function-calling datasets. APIGen was used to train models over 7B parameters based on state-of-the-art benchmarks —> Read more. MISeDGoogle Research published a paper introducing Meeting Information Seeking Dialogs(MISeD), a dataset focused on meeting transcripts. MISeD tries to optimize for finding factual information in meeting transcripts which could be a notoriously difficult task —> Read more. Olympic ArenaResearchers from Shanghai Jiao Tong University, Generative AI Research Lab published a paper detailing the results of the Olympic Arena superintelligence benchmark. Olympic Arena was designed to evaluate models across many disciplines and modalities —> Read more. Exams for RAG PipelinesAmazon Science published a paper discussing a technique to evaluate the accuracy of RAG applications. The methods mimics an exam generation process based on item response theory —> Read more. 🤖 Cool AI Tech ReleasesMLflow at SageMakerAmazon is launching support for Mlflow in its SageMaker platform —> Read more. Multimodal ArenaChatbot Arena just added support for multimodal models —> Read more. Meta LLM CompilerMeta AI open sourced its LLM Compiler, a family of Code LLama based models with compiter and optimization capabilities —> Read more. Character CallsCharacter AI introduced Character Calls, a voice interaction experience with Characters —> Read more. 🛠 Real World AIIncident Response at MetaMeta shares some details about their usage of generative AI for incident response —> Read more. ETA at LyftLyft discusses the ML techniques used to ensure estimated time of arrival(ETA) reliability for riders —> Read more. 📡AI Radar
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
📝 Guest Post: Designing Prompts for LLM-as-a-Judge Model Evals*
Friday, June 28, 2024
In this guest post, Nikolai Liubimov, CTO of HumanSignal provides helpful resources to get started building LLM-as-a-judge evaluators for AI models. HumanSignal recently launched a suite of tools
Edge 406: Inside OpenAI's Recent Breakthroughs in GPT-4 Interpretability
Thursday, June 27, 2024
A new method helps to extract interpretable concepts from large models like GPT-4. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Edge 407: LLMs with Infininite Context Windows? Short-Term Memory and Autonomous Agents
Tuesday, June 25, 2024
The role of context windows in LLMs ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
📽 [Virtual Talk] Powering millions of real-time rankings at GetYourGuide
Monday, June 24, 2024
Hi there, Curious about how GetYourGuide, a leading online marketplace for travel excursions, delivers millions of personalized rankings daily, adapting to users' preferences in real time? Join us
Beyond OpenAI: Apple’s On-Device AI Strategy
Sunday, June 23, 2024
Plus a new super coder model, Meta's new AI releases, DeepMind's video-to-audio models and much more. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
You Might Also Like
💻 Your Guide to Using Siri on the Mac — How to Make Your Mechanical Keyboard Thocky
Sunday, December 29, 2024
Also: How to Use Controlled Folder Access in Windows 11, and More! How-To Geek Logo December 29, 2024 Did You Know The football huddle, where players circle up close together, was created by Paul
I (still) don’t know what “craft” means & Creativerly's Favourite Apps of 2024
Sunday, December 29, 2024
The next era of design is intent-driven, Capacities end-of-year update, what's next for Play in 2025, quiet leadership, and a lot more in this week's issue of Creativerly. Creativerly I (still)
Sunday Digest | Featuring 'Visualizing $102 Trillion of Global Debt in 2024' 📊
Sunday, December 29, 2024
Every visualization published this week, in one place. Dec 29, 2024 | View Online | Subscribe | VC+ | Download Our App Happy Holidays from everyone at Visual Capitalist! Our Global Forecast Series 2025
Android Weekly #655 🤖
Sunday, December 29, 2024
View in web browser 655 December 29th, 2024 Articles & Tutorials Sponsored Advertise your Android dev course to over 80k readers We reach out to more than 80k Android developers around the world,
Moving Past RLHF: In 2025 We Will Transition from Preference Tuning to Reward Optimization in Foundation Models
Sunday, December 29, 2024
Models like GPT-o3 and Tülu 3 are showing the way. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Veo 2/TimeCapsule/Network of Time
Sunday, December 29, 2024
Recomendo - issue #443 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Log Alarm Package, replaceRecursive, takeWhile, and more! - №545
Sunday, December 29, 2024
Your Laravel week in review ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Kotlin Weekly #439
Sunday, December 29, 2024
ISSUE #439 29th of December 2024 And that's a wrap! Thanks for being with us throughout 2024. We had the opportunity to meet many of you at KotlinConf and provide live coverage. We witnessed
Dark forest, bad art and paying to bike
Saturday, December 28, 2024
Neologism #24, 28.12.2024 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Weekend Reading — Happy New Year! 🥳
Saturday, December 28, 2024
Vitalis 🇺🇦 The most original and unusual landmark in Odesa, which has become a symbol of the creativity of Odesa residents. Tech Stuff Cursor I really really like Cursor. I had a great time using VS