The Sequence Chat: Why Transformers are the Best Thing that Ever Happened to NVIDIA
Was this email forwarded to you? Sign up here The Sequence Chat: Why Transformers are the Best Thing that Ever Happened to NVIDIAA discussion about some controvertial and original ideas in AI.I wanted to devote some installments of The Sequence to outline some reflections about several controversial ideas around AI. At the end, one of the rarest things to find in today’s market plagued with hundreds of AI newsletters are publications that discuss original ideas. I think this section would be a cool complement to our interview series and, if nothing else, might force you to think about these topics even if you disagree with my opinion 😉 Today, I would like to start with a simple but controversial thesis that I was discussing with some of my students recently. The cornerstone of this thesis is why the transformer architecture used in foundation models is, arguably, the best thing that ever happened to NVIDIA. Have you ever heard the phrase that the only company turning real profits in AI is NVIDIA? Well, transformers have a lot to do with that. The main reasons are both technical and market related:
Let’s dive into these two points:... Subscribe to TheSequence to unlock the rest.Become a paying subscriber of TheSequence to get access to this post and other subscriber-only content. A subscription gets you:
|
Older messages
NVIDIA Releases Nemotron 70B
Sunday, October 20, 2024
The new model has been making the headlines due to its impressive performance. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
AI Dropped the Mic at the Nobel Party
Sunday, October 20, 2024
Two Nobel Prizes were awarded to AI scientists ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Edge 439: SSMs with Attention, Understanding Zamba
Sunday, October 20, 2024
Combining the best of SSMs and transformers in a single architecture. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Edge 440: Interested in AI Evaluation? Meet Microsoft's EUREKA
Sunday, October 20, 2024
The framework provides an evaluation pipeline as well as a collection of benchmarks for evaluating language and vision capabilities. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Edge 437: Inside BlackMamba, One of the Most Important SSM Models Ever Created
Tuesday, October 8, 2024
The model combines SSMs, MoEs in a single architecture. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
You Might Also Like
Speeding up with SIMD and Go assembly
Tuesday, October 22, 2024
Plus some Go code generation magic, test parallelism, and working with Excel spreadsheets. | #528 — October 22, 2024 Unsub | Web Version Together with Ardan Labs Go Weekly A Taste of Go Code Generator
LW 155 - Optimizing Shopify Themes for Long Product Descriptions
Tuesday, October 22, 2024
Optimizing Shopify Themes for Long Product Descriptions Shopify Development news and articles
Secure Your Election 2024 eBook at the Best Value Today ⏰
Tuesday, October 22, 2024
Stay informed with our visual guide to the US Presidential Election—exclusively for VC+ members, along with additional updates. View email in browser Now Available: The Visual Guide to the US Election
Startups of The Year: How To Vote
Tuesday, October 22, 2024
Top Tech Content sent at Noon! How the world collects web data Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, October 22, 2024? The HackerNoon
JS0/JSSugar; converting CommonJS to ESM; top-level await in Node.js; Node.js v23.0.0; Bun v1.1.31;
Tuesday, October 22, 2024
We have 8 links for you - Stay up-to-date on JavaScript and tools Discussing JS0/JSSugar docs.google.com We linked to the JS0/JSSugar slides in last week's issue. We now have reactions to those
Gartner's 2025 tech trends: Adapt - and fast
Tuesday, October 22, 2024
iPhone 16 Camera Control secrets; Lyft Election Day discount; Best TVs -- ZDNET ZDNET Tech Today - US October 22, 2024 future-tunne-gettyimages-108356213 Gartner's 2025 tech trends show how your
⚙️ Microsoft, Google & AI Agents
Tuesday, October 22, 2024
Plus: Europeans would let an AI vote for them
Post from Syncfusion Blogs on 10/22/2024
Tuesday, October 22, 2024
New blogs from Syncfusion Automate Flowchart Creation from External Data with Blazor Diagram By Suganthi Karuppannan Learn how to automate flowchart creation from external data using the Syncfusion
Urgent: VMware Releases vCenter Server Update to Fix Critical RCE Vulnerability
Tuesday, October 22, 2024
THN Daily Updates Newsletter cover See Yourself in Cyber: Security Careers Beyond Hacking ($17.00 Value) FREE for a Limited Time A one-of-a-kind discussion of how to integrate cybersecurity into every
Edge 441: SSMs Beyond Language
Tuesday, October 22, 2024
In this issue: ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏