Barak Turovsky (Exec In Residence, Scale Venture Partners): How to Evaluate Generative AI Use Cases
Barak Turovsky (Exec In Residence, Scale Venture Partners): How to Evaluate Generative AI Use CasesWhy search might be a red herring and who will capture the most value in the AI stackDear subscribers, Today, I want to share a great framework for evaluating generative AI use cases. Barak Turovsky is Executive in Residence at Scale Venture Partners and ex-head of product for Google Languages AI. I worked with Barak a decade ago, so naturally I had to chat with him about AI. In the interview below, we talk about:
How to evaluate generative AI use casesWelcome Barak! What’s your framework for evaluating generative AI use cases? I like to evaluate use cases across two axis:
Here’s the breakdown:
Makes sense. Just the other day I was using ChatGPT to clean up some data and it started making up numbers halfway through! Are there scenarios where generative AI can still help with high accuracy use cases? Yes, they can still help if you have a person manually checking the AI’s output. For example, ChatGPT can write the first draft of your business email for you to edit. But manual review breaks down in high volume use cases like search where it becomes impossible to have a trained person validate every result. Why search might be a red herring vs. other use cases for large language modelsBoth Microsoft and Google are racing to use generative AI and large language models (LLMs) to improve search. Can you describe the types of search results that might be a good fit for these AI models? Yes, let’s use the same framework above to breakdown search:
That matches my personal experience with Bing AI. Over time, I found myself using it for creator use cases (e.g., Make this content more clear and concise.”) Yes, a third axes of the framework above is how high the stakes are.
Imagine the LLM gave you a Disneyland itinerary that recommended a subpar hotel or a restaurant that’s actually closed. You would be pretty mad. Yeah, despite the hype I don’t think anyone will blindly trust LLMs to book travel or restaurants right now. How fast will LLMs fix the hallucination problem? I think it might be hard to improve accuracy. LLMs like GPT4 are already trained on trillions of parameters, so there’s a diminishing return to creating bigger models. The main value of LLMs is fluency with “good enough” accuracy. Even at 80-90% accuracy, LLMs could disrupt a wide variety of industries. Which markets do you think LLMs would disrupt first? So we discussed how LLMs are ideal for creator and productivity use cases. Here are three other markets that are ripe for disruption:
Which companies might capture the most value in the AI stackCan you describe the generative AI stack and which layer might capture value? At a high level, there are three layers:
In terms of which layer will capture the most value, my bet is on the infra layer:
Applications, on the other hand, are risky. There’s a joke that many AI apps are just wrappers around OpenAI’s APIs. How do you build a moat in the application layer? I think AI apps can build moats in a few ways:
Any closing thoughts on generative AI? We’re still in the early stages of the AI revolution but I think one thing is clear: Companies need to think about how they can use generative AI to enhance their product or they’ll risk getting left behind. As with every new tech, productizing generative AI is both exciting and scary. I’m excited to see more companies use the framework we discussed to cross the chasm. Thank you Barak! If you enjoyed this conversation, please follow Barak on LinkedIn. Creator Economy by Peter Yang is free today. But if you enjoyed this post, you can tell Creator Economy by Peter Yang that their writing is valuable by pledging a future subscription. You won't be charged unless they enable payments. |
Older messages
The Day You Stopped Making Compromises on Product Quality
Wednesday, April 19, 2023
Why product quality matters and how you can make it a priority in your company
Keren Baruch (PM Director, LinkedIn): How Creators Can Thrive on LinkedIn
Wednesday, April 5, 2023
Advice from LinkedIn's creator product lead on how to unlock economic opportunity
Jay Clouse (Creator Science): How to Make $500K Online as a Knowledge Creator
Wednesday, March 29, 2023
Jay's advice for new creators and the pros and cons of creator monetization channels
The 5-Step Playbook to Turn Your Knowledge into Income Online
Wednesday, March 22, 2023
How to make money on the internet while you sleep
Ankur Nagpal (Founder, Ocho): How Creators and Business Owners Can Build Wealth
Thursday, March 16, 2023
Learn how you might save thousands of dollars in this tax season
You Might Also Like
Recruiting Brainfood - Issue 428
Sunday, December 22, 2024
Merry Christmas everybody - it's the Brainfood Christmas Special, so we bear gifts and reflections from the world of TA / HR on the year 2024. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Simple hack to get 4x more shares
Sunday, December 22, 2024
Inro, Qolaba, MySEOAuditor, ContentRadar, and SEO Pilot are still available til end of this week. Then, they're gone!! Get these lifetime deals now! (https://www.rockethub.com/) Today's hack
I built an online tool site in 5 minutes
Sunday, December 22, 2024
AI tools are getting even more incredible
How to Describe a Hallucination
Saturday, December 21, 2024
If hallucinations defy the grasp of words, how should we try to describe them?
+28,000% Engagement with Pinterest?
Saturday, December 21, 2024
Exploding impressions and engagement
The importance of pillar pages for SEO
Saturday, December 21, 2024
88% of SEOs believe topical authority is very important to their SEO strategy, according to a Surfer SEO study. One of the most effective ways to strengthen your topical authority is by creating pillar
The importance of pillar pages for SEO
Saturday, December 21, 2024
88% of SEOs believe topical authority is very important to their SEO strategy, according to a Surfer SEO study. One of the most effective ways to strengthen your topical authority is by creating pillar
The best books about AI&ML, 2024 edition
Saturday, December 21, 2024
For Your Holiday Reading
How to control your audience
Saturday, December 21, 2024
And turn them into raving fan-customers
$166K MRR - simple employee scheduling tool..
Saturday, December 21, 2024
+ What do you think?