🎙Yinhan Liu/CTO of BirchAI about applying ML in the healthcare industry
Was this email forwarded to you? Sign up here It’s so inspiring to learn from practitioners. Getting to know the experience gained by researchers, engineers, and entrepreneurs doing real ML work is an excellent source of insight and inspiration. Share this interview if you find it enriching. No subscription is needed. 👤 Quick bio / Yinhan Liu
Yinhan Liu (YL): I started my undergrad as a Chemical Engineering major and added a math major – not focused at all on CS. I didn’t get my start in the field until I took an ML class during my first semester of grad school, which inspired me to spend a lot of personal time reading AI-related papers. I eventually made my way to Facebook AI Research, where I had the opportunity to work with some great people at an important time in NLP history. But, while I enjoyed the research side of things, I wanted to have a more direct impact on people. So, I decided to co-found BirchAI at AI2 with trusted colleagues I had known for 5 to 10 years. I’m now its CTO, leading Engineering and Science. 🛠 ML Work
YL: BirchAI is focused on applying AI to complex audio processes in healthcare – an area that Sumant (COO), Kevin (CEO), and I have been thinking about for a long time. There’s much more beyond this, but our initial focus is on automating complex After Call Work in healthcare call centers – think of a patient calling in about an issue with her pacemaker. The healthcare industry faces several related business challenges that drive our ML challenges. For example, humans vary a lot in terms of how they understand, classify, and summarize detailed healthcare conversations. For BirchAI, that means that IF the data is labeled, it is usually labeled poorly. We have developed effective workarounds that have allowed us to achieve very high accuracy at scale. That leads us to another point: the notion of “Explainable Human”. Many customers initially maintain that their call center teams already achieve consistency and accuracy of 98 or 99%. Invariably we see that is not true. Companies think they know how employees are doing the work. But it is based on crude, low-volume, and manual sampling methods of Quality Assessment that fail to understand the semantic richness of conversations at scale and how that dialogue should be characterized. The BirchAI product highlights this variance and gives us the means to drive and maintain consistency and accuracy at a previously unattainable scale. Healthcare companies spend tens of billions of dollars trying to address these questions – we are addressing those at scale.
YL: Our first challenge is that our data is not labeled – and large-scale pre-trained models do not work out of the box. We have built a complex AI-based pipeline to label data we use to train at scale and then reach a high degree of accuracy. Another challenge is that these problems cannot be met with a single module – so we’ve used a multi-modal approach to create a robust pipeline of models for our product.
YL: Previous NLP technology was essentially as developed as it was going to get, yet it was not accurate or robust enough to meet customer needs for most healthcare use cases. As a result, many processes are still done manually. But pre-trained models with a transformer architecture now provide a higher performance starting point, and there is much more to be discovered. We intimately understand those opportunities in areas like voice and document AI, and we are actively exploiting those to build game-changing products in healthcare.
YL: 1. The first big problem we needed to overcome was Speech to Text – we have found that the off-the-shelf APIs do not provide a good enough input for our downstream models. That’s why we built our own STT that consistently outperforms the other STT models we can see. Of course, we will continue to improve this model, which is flexible enough to allow that. 2. Another problem has been how to optimize our models. We are not a consulting shop – we are a product company. How do we maximize production performance with the fewest possible models? For example, we have a large medical device customer with a single, high-quality dialogue summarization model working across four different products. We are starting to deploy that and are excited to see how we can extend that across all their products. 3. At the core of our capabilities has been the ability to use AI itself to create high-quality, large-scale, labeled data. This is similar in concept to back-translation, where we use AI models to create labels at scale, and we then train other AI models using those labels and other data. We’ve had great success with this and see many possibilities for the approach.
YL: It’s not enough to create a huge model that uses massive amounts of expensive computing to create a result out in dev. We’ve been lucky enough to recruit a great founding engineer, Gaurav Shegokar, who really understands how to optimize inference time and accuracy – or infrastructure and performance. That blend of traditional software and AI at scale is a key characteristic we look for in new engineering hires. 💥 Miscellaneous – a set of rapid-fire questions
Twin Primes.
Introduction to Statistical Learning. It tells you everything you need to get started!
We use a bit of a Turing Test approach when we show people our dialogue summaries. After more than 50 interactions, more people identify our BirchAI-generated summary as created by a human than the correct one. So yes, I guess it is still relevant.
No, I don’t believe so. |
Older messages
➰➰ Edge#157: CI/CD in ML Solutions
Tuesday, January 18, 2022
In this issue: we explore CI/CD in ML Solutions; we discuss Amazon's continual learning architecture that manages the ML models lifecycle; we overview CML, an open-source library for enabling CI/CD
🚘 Uber Continues its Open-Source ML Traction
Sunday, January 16, 2022
Weekly news digest curated by the industry insiders
📥 Download your AI Infrastructure report from Forrester Research*
Friday, January 14, 2022
Courtesy of Run:AI
📌 Event: Join us at apply() – the ML Data Engineering Community Meetup
Thursday, January 13, 2022
It's free
📊 👩💻🥸 Edge#156: The ML Powering LinkedIn’s Recruiting Recommendation System
Thursday, January 13, 2022
Deep dive into an incredibly sophisticated series of search and recommendation algorithms
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your