The Promise and Pitfalls of Chaining Large Language Models for Email
Tomasz TunguzVenture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. The Promise and Pitfalls of Chaining Large Language Models for Email
Over the last few weeks I’ve been experimenting with chaining together large language models. I dictate emails & blog posts often. Recently, I started using Whisper for drafting emails and documents. (Initially there were some issues with memory management, but I’ve since found a compiled version that works well on my Mac called whisper.cpp) After tying Google’s Duet I wondered if I could replicate something similar. I’ve been chaining the Whisper dictation model together with LLaMA 2 model from Facebook. When drafting an email, I can dictate a response to LLaMA 2, which will then generate a reply using the context from my original email. So far it works sometimes, but there are some clear limitations: First, the default tone of the generated emails is far too formal. [Link to example] Second, if I prompt LLaMA 2 to use a more casual tone, it often goes too far in the other direction. The problem is a lack of nuanced context - the appropriate level of familiarity varies greatly between emails to close colleagues versus board communications or potential investors. Without that nuance labeled and incorporated into the training data, it’s hard for the model to strike the right tone. Third, in multi-party email threads things can get confusing. If Lauren introduces Rafa to me, then Rafa bccs Lauren on the email, LlaMA 2 often replies as Lauren. Fourth, figuring out exactly the right settings for the model can be tough. Sometimes I dictate long emails, in which case the context windows (how much the computer listens to before transcribing) should be very long so the system can remember what I’ve said previously. Other times I’m just returning a very fast email. A quick see you soon or thank you very much. In which case a long context window doesn’t make sense and I’m left waiting for the system to process. I’m wondering whether small errors in the first model compound in the second model. Bad data from the transcription -> inaccurate prompt to the LLM -> incorrect output. I’m loo Overall the potential is exciting, but there are still challenges around tone, context, and multi-party interactions that need to be addressed before this can become a seamless productivity tool. Tn machine learning systems, achieving an 80% solution is pretty rapid. The marginal 15% - the magic behind ML - takes a huge amount of effort, data, & tuning. |
Older messages
SaaS Competitive Advantage Through Elegant LLM Feedback Mechanisms
Tuesday, October 3, 2023
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. SaaS Competitive Advantage Through Elegant LLM Feedback
Centaurs & Cyborgs : The Jagged Frontier of AI
Monday, October 2, 2023
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. Centaurs & Cyborgs : The Jagged Frontier of AI Last week,
Artisanal Emails
Wednesday, September 27, 2023
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. Artisanal Emails Artisanal chocolate, artisanal candles,
Avoiding the PLG Trap : Office Hours with Oliver Jay
Monday, September 25, 2023
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. Avoiding the PLG Trap : Office Hours with Oliver Jay On
Building Applications with AI - Lessons from LangChain, Hearth, & Context.ai
Thursday, September 21, 2023
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. Building Applications with AI - Lessons from LangChain, Hearth,
You Might Also Like
🚀 Ready to scale? Apply now for the TinySeed SaaS Accelerator
Friday, February 14, 2025
What could $120K+ in funding do for your business?
📂 How to find a technical cofounder
Friday, February 14, 2025
If you're a marketer looking to become a founder, this newsletter is for you. Starting a startup alone is hard. Very hard. Even as someone who learned to code, I still believe that the
AI Impact Curves
Friday, February 14, 2025
Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here. AI Impact Curves What is the impact of AI across different
15 Silicon Valley Startups Raised $302 Million - Week of February 10, 2025
Friday, February 14, 2025
💕 AI's Power Couple 💰 How Stablecoins Could Drive the Dollar 🚚 USPS Halts China Inbound Packages for 12 Hours 💲 No One Knows How to Price AI Tools 💰 Blackrock & G42 on Financing AI
The Rewrite and Hybrid Favoritism 🤫
Friday, February 14, 2025
Dogs, Yay. Humans, Nay͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
🦄 AI product creation marketplace
Friday, February 14, 2025
Arcade is an AI-powered platform and marketplace that lets you design and create custom products, like jewelry.
Crazy week
Friday, February 14, 2025
Crazy week. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
join me: 6 trends shaping the AI landscape in 2025
Friday, February 14, 2025
this is tomorrow Hi there, Isabelle here, Senior Editor & Analyst at CB Insights. Tomorrow, I'll be breaking down the biggest shifts in AI – from the M&A surge to the deals fueling the
Six Startups to Watch
Friday, February 14, 2025
AI wrappers, DNA sequencing, fintech super-apps, and more. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
How Will AI-Native Games Work? Well, Now We Know.
Friday, February 14, 2025
A Deep Dive Into Simcluster ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏