Tomasz Tunguz - The Challenge of the AI Demo
Tomasz TunguzVenture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. The Challenge of the AI Demo
The AI Demo isn’t easy. Many of the major AI companies have demoed their AI systems, first starting with pre-recorded, & now pushing into live demos. They don’t always work. Multiply Murphy’s Law by a non-deterministic system & it’s not unreasonable to expect AI demos to nearly always hiccup. Demo disruptions aren’t disaster. These systems are early & changing rapidly. They might suggest the system requires work & tuning, not a fundamental challenge. But, they can be problematic in proofs-of concept. Proofs of concept are extended demonstrations of the software. Well-structured PoCs align on success criteria at the outset. These criteria enable vendors & customers to agree on what success looks like. Worflow proofs-of-concept are relatively straightforward. They are deterministic. Can I process a loan application in 5 minutes? Yes or no. But as AI applications shift to selling outcomes implicitly or explicitly, the PoC becomes a testing ground of those outcomes. Non-determinism means sometimes the PoC won’t produce the required wow moment. This also means the PoC criteria must be more flexible. How does a buyer evaluate a probabilistic system? Do we compare it to human performance? Speaking to some practitioners, they’ve shared with us human labelers typically agree on 60-70% of the time. Does a AI robot need to be as accurate as a human assuming it will be much less expensive? Or will we expect more as we do in self-driving cars? If AI systems require human assistance, then the ROI of the system must include some human operating expense - whether explicit or implicit. Some teams will want to benchmark systems in parallel to determine the relative performance. With most startups building atop existing models & setting aside differences in fine-tuning, the ultimate performance should be relatively comparable, provided they use the same data sets. Will startups compete on access to different data sets? Today, there are more questions than answers about how to sell AI agent systems. We’re hosting an event on the evening of Sep 10th in San Francisco to interview leaders in the space moderated by Dave Morse, former CRO at Hebbia & VPS/VPCS at ScaleAI to talk about some of these questions. If you’re interested to attend, see the details here. |
Older messages
Which Design Era Are We In?
Tuesday, September 3, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. Which Design Era Are We In? When the internet became popular
The Dislocation Between Public & Private Web3 Markets
Friday, August 30, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. The Dislocation Between Public & Private Web3 Markets With
Higher Levels of Abstraction
Monday, August 26, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. Higher Levels of Abstraction Over the weekend, Andrej Karpathy
What Has Your GPU Done For You Today?
Friday, August 23, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. What Has Your GPU Done For You Today? A year ago, enterprises
Things that Used to be Impossible, but are Now Really Hard
Tuesday, August 20, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. Things that Used to be Impossible, but are Now Really Hard
You Might Also Like
75 Cents per Month
Monday, November 18, 2024
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. 75 Cents per Month What it cost to have an assistant with you
Questions About Outbound Lead Gen?
Monday, November 18, 2024
You are Invited: AMA with Chris Marin, CEO at Convert.AI
🦄 Streamlined authentication for financial institutions
Monday, November 18, 2024
Illuma provides voice authentication and fraud prevention for credit unions and community banks.
12 Silicon Valley Startups Raised $508.8 Million - Week of November 18, 2024
Monday, November 18, 2024
💰 It's 'Liquidity, Stupid' 🤝🏻 US, China's AI Codependency 🎤 Roadmap: Voice AI 💰 Why Economists Hate Trump's Tariff Plan ⚔️ What Elon Musk Wants From Donald Trump ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
😱 The lawsuit that nearly destroyed Foundr
Monday, November 18, 2024
That nightmare turned into my most important lesson ever! Black Friday_Header_2 Hey Friend , Today, I want to share something personal about how Foundr came to be. A story that taught me one of the
Cédric O’s new gig
Monday, November 18, 2024
Plus: Bristol-based startups to watch; latest deals View in browser Sponsor Card - Up Round-27 Good morning there, Around this time last year, France's former digital minister-turned-Mistral-
💄⏰ 6 hours left - last chance for the biggest opportunity
Sunday, November 17, 2024
Friend , you're so close to the start of something incredible... Hey Friend , Less than 6 hours left to jump into How to Build a Million Dollar Beauty Brand with Alicia Scott. Before you decide, I
🌅 Golden Age of Building
Sunday, November 17, 2024
Now is the best time in history to be a builder. Let's build things to make the country better. This Week at YC November 17th, 2024 ✨ The Latest Request For Startups Now is the best time in history
🔴 12 hours left - final notice for Friend
Sunday, November 17, 2024
This expires soon and you will miss out. Build a beauty brand you're proud of. Hey Friend , What would it mean to you to finally see your beauty brand dream come to life? To take that idea in your
#206 | Intelligent Automation, Request for Startups, & more
Sunday, November 17, 2024
Nov 17th | The latest from A16Z, Y Combinator, Scale Venture Partners, Homebrew, and others ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏