Tomasz Tunguz - AI Design Patterns
Tomasz TunguzVenture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. AI Design Patterns
As we’ve been researching the AI landscape & how to build applications, a few design patterns are emerging for AI products. These design patterns are simple mental models. They help us understand how builders are engineering AI applications today & which components may be important in the future. The first design pattern is the AI query router. A user inputs a query, that query is sent to a router, which is a classifier that categorizes the input. A recognized query routes to small language model, which tends to be more accurate, more responsive, & less expensive to operate. If the query is not recognized, a large language model handles it. LLMs much more expensive to operate, but successfully returns answers to a larger variety of queries. In this way, an AI product can balance cost, performance, & user experience. The second design pattern is for training. Models are trained with data (which can be real-world & synthetic or made by another machine), then they are sent for evaluation. The evaluation is a topic of much debate today because we lack a gold standard of model greatness. The challenge with evaluating these models is the inputs can vary enormously. Two users are unlikely to ask the same question in the same way. The outputs can also be quite variable, a result of the non-determinism & chaotic nature of these algorithms. Adversarial models will be used to test & evaluated AI. Adversarial models can suggest billions of tests to stress the model. They can be trained to have strengths different to the target model. Just as great teammates & competitors improve our performance, adversarial models play will play that role for AI. The core security around LLMs has two components. A user component, here it’s called a proxy, & a firewall, which wraps the model. The proxy intercepts a user query both on the way out & on the way in. The proxy eliminates personally identifiable information (PII) & intellectual property (IP), logs the queries, & optimizes costs. The firewall protects the model & the infrastructure it uses. We have a minimal understanding of how humans can manipulate models to reveal their underlying training data, their underlying function, & the orchestration for malicious acts today. But we know these powerful models are vulnerable. Other security layers will exist within the stack, but in terms of the query path, these are the most important. The last of our current design patterns in the AI developer design path. The developer’s machine is secured with endpoint detection & response, or EDR, to ensure that the data being used to train models & the underlying models are not poisoned. The developer’s code is sent to a CICD system. The CICD system checks the model & the data are correct using signatures (Sig Verification). Today, most softwares’ signatures are verified. But not AI models. Also, the large language model will be subjected to a testing harness (a series of tests) to ensure that it performs as expected. Real user queries from live traffic will inform the harness. Once those tests pass, the model is pushed to production. These are our four current mental models for how large language models will be built, secured, & deployed. These are sketches of each leg of an elephant we are trying to draw in a dark room. If you have ideas about other design patterns or improvements to the current ones, please contact us. We’d love to improve these to help others. |
Older messages
Standard Issue AI
Friday, February 2, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. Standard Issue AI “For some companies, [AI is] going to be
Experience as a Status Symbol
Wednesday, January 31, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. Experience as a Status Symbol Last year, I argued every
The Fastest Growing Software Sectors in 2024
Monday, January 29, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. The Fastest Growing Software Sectors in 2024 The fastest
AI Drove the Largest New Bookings of Any New Product
Friday, January 26, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. AI Drove the Largest New Bookings of Any New Product
Dissecting Delegation: Diving Deep on The Missing B-School Class
Wednesday, January 24, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. Dissecting Delegation: Diving Deep on The Missing B-School
You Might Also Like
#211 | The Next Standard Oil, AI SDRs, Four Futures, & more
Sunday, December 22, 2024
Dec 22nd | The latest from IVP, Madrona, Khosla Ventures, Sapphire, USV, and others ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
This might be the simplest $12K/month app ever
Sunday, December 22, 2024
Starter Story Sunday Breakfast ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Behind the founder: Marc Benioff
Sunday, December 22, 2024
Salesforce CEO Marc Benioff on the beginners mind, AI, marketing, leadership, and life lessons from Steve Jobs ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
🎄 𝟖 𝐃𝐚𝐲𝐬 𝐎𝐧𝐥𝐲: 𝟖𝟓% 𝐎𝐅𝐅 + 𝟏 𝐅𝐑𝐄𝐄 𝐂𝐨𝐮𝐫𝐬𝐞!
Sunday, December 22, 2024
This 8-day holiday deal is the perfect gift for your 2025 self! fdrlogo Hey Friend , The holidays are here, and it's time to kickstart your journey toward an incredible 2025! Foundr's 8-Day
What’s 🔥 in Enterprise IT/VC #425
Saturday, December 21, 2024
Will business applications collapse in the agent era? Where does logic go + what are the "second order" effects when dozens or hundreds of agents are running amok at enterprises? ͏ ͏ ͏ ͏ ͏ ͏
🚀 The 2024 Space Stock Rally
Friday, December 20, 2024
Plus Synspective IPO's in Tokyo, $HON exploring strategic alternatives for Aerospace business, and more! The latest space investing news and updates. View this email in your browser The Space Scoop
🚨 Announcing: The inaugural “What’s in your stack?” survey
Friday, December 20, 2024
Tracking the most commonly used (and beloved) tools in tech ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
2025 Predictions
Friday, December 20, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. 2025 Predictions Every year I make a list of predictions &
🗞 What's New: What OpenAI's new o3 model means for indie hackers
Friday, December 20, 2024
Also: AI-powered reading is coming ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
🚨 Our final email. Don’t miss the insights that could change your 2025
Friday, December 20, 2024
24 hours to catch the best moments of the ecommerce summit 2025! Hey Friend , This is it—our final email about this. The full replay of the Start Your Ecommerce Business Summit 2025 will be gone in 24