🎙 Ran Romano/Qwak about bridging the gap between data science and ML engineering
Was this email forwarded to you? Sign up here It’s so inspiring to learn from practitioners and thinkers. Getting to know the experience gained by researchers, engineers, and entrepreneurs doing real ML work is an excellent source of insight and inspiration. Share this interview if you like it. No subscription is needed. 👤 Quick bio / Ran Romano
Ran Romano (RR): My name is Ran Romano. I am married + 2 and live in Israel. I started my professional path at the intelligence cores of the Israel Defence Forces (IDF). I was always curious about the relationship between engineering and data science and the impact of merging both disciplines to deliver complete solutions. Later, during my time at Wix, I got the opportunity to feed my curiosity by founding and leading the team responsible for building Wix’s Internal ML platform, which is designed to allow efficient and continuous delivery of ML models to production. Nowadays, I lead Qwak’s engineering & product teams and leverage my experience and domain expertise in order to build a world-class ML Engineering platform. 🛠 ML Work
RR: Qwak, at its core, enables data scientists and ML engineers to build, deploy, maintain and monitor ML models & features in production with minimal engineering friction. One of our main pillars, and an approach we highly advocate for, is a production-first approach for applying ML models to business needs. DS teams should not spend months iterating on models within their playground but strive to deploy and test their models as fast as possible in the wild and iterate quickly while in production, having their model already delivering value. So we’ve built our platform around the notion of decoupling DS teams from their engineering counterparts in order to allow them to run experiments faster and more independently. We want them to be able to work and not have to go back and forth continuously and waste precious time and resources.
RR: Qwak build system adds “traditional” build processes to machine learning models and allows data scientists to build an immutable, versioned, and tested production-grade artifact. Our build system standardizes an ML project structure that automatically versions data, code, and parameters for every model build. The Qwak build system takes its inspiration from traditional build tools while at the same time augmenting ML-specific capabilities like experiment tracking, code parameters, and data versioning. The great thing about the build system is that it leverages best practices forged over the past decade in the DevOps world and applies them to the ML engineering world.
RR: One of the key challenges we observe in ML serving space is the attempts to deploy hundreds or even thousands of similar purpose models simultaneously in a manageable fashion. In most cases, that means training the same model but for hundreds or thousands of different datasets. For example, a B2B company with multiple customers would be better off training a model per customer rather than training a single generic model for all customers alike. For that end, Qwak provides “Model Per Dimension” type deployments which enable a cost-efficient, scalable, and manageable solution to deploy large numbers of models. Qwak Serving allows deployment of any type of model to production with, in most cases, a single click, aiming to reduce the friction between data science and engineers. At its core, it simply provides a faster way to put your models into production. The serving mechanism enables teams to deliver prediction services in a fast, repeatable, and scalable way – including advanced metrics, logging, and alerting capabilities. One of the main advantages of our build system is that its product, a deployable artifact, is completely deployment agnostic. Thus allowing a seamless deployment experience to either batch, real-time, or streaming inference. This means the engineer does not have to stick to a single deployment method which is usually a business-related requirement.
RR: We see it as a classic problem of best-of-breed vs. best-of-suite. With a best-of-breed approach, you get a composable architecture that could potentially increase the flexibility of your home-grown ML Platform, however at the cost of needing to maintain much “glue code” in order to piece together different components which are naturally not designed to work together as a single coherent platform. And we see a trend of these single silos companies expanding into other domains in the ML pipeline to offer a complete solution. On the other hand, cloud providers tend to deliver an all-encompassing toolchain that covers the entire ML domain for feature engineering to AutoML solutions without any specific focus. This tactic, as I learned firsthand from my time in Wix, of providing rudimentary building blocks forces ML teams (like my team at Wix) to build their own infrastructure on top of the ML infrastructure the cloud providers provide. 💥 Miscellaneous – a set of rapid-fire questions
Simpson’s paradox - even though it’s not a real paradox :)
Not just for ML Engineers - Principles by Ray Dalio.
To a much lesser extent than in the past. I like the approach of the Marcus test.
No You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
🤷🏻 Edge#211: What to Test in ML Models
Tuesday, July 26, 2022
In this issue: we discuss what to test in ML models; we explain how Meta uses A/B testing to improve Facebook's newsfeed algorithm; we explore Meta's Ax, a framework for A/B testing in PyTorch.
🗄 A Model Compression Library You Need to Know About
Sunday, July 24, 2022
Weekly news digest curated by the industry insiders
📌 Event: Join us for this live webinar to learn how Tide reduced model deployment time by 50%!
Friday, July 22, 2022
A real use case you don't want to miss!
🟢⚪️ Edge#210: Hopsworks 3.0, Connecting Python to the Modern Data Stack
Thursday, July 21, 2022
On Thursdays, we deep dive into one of the freshest research papers or technology frameworks that is worth your attention. Our goal is to keep you up to date with new developments in AI and introduce
📌 Event: Join us at The Future of Data-Centric AI 2022 — a free virtual event by Snorkel AI
Wednesday, July 20, 2022
We're excited to partner with Snorkel AI on The Future of Data-Centric AI, a free two-day virtual event on August 3-4 that will cover the latest data-centric approaches to AI application
You Might Also Like
The ultimate holiday gadget gift
Monday, November 25, 2024
AI isn't hitting a wall; $70 off Apple Watch; 60+ Amazon deals -- ZDNET ZDNET Tech Today - US November 25, 2024 Meta Quest 3S Why the Meta Quest 3S is the ultimate 2024 holiday present This $299
Deduplication in Distributed Systems: Myths, Realities, and Practical Solutions
Monday, November 25, 2024
This week, we'll discuss the deduplication strategies. We'll see whether they're useful and consider scenarios where you may need them. We'll also do a reality check with the promises
How to know if your data has been exposed
Monday, November 25, 2024
How do you know if your personal data has been leaked? Imagine getting an instant notification if your SSN, credit card, or password has been exposed on the dark web — so you can take action
⚙️ Amazon and Anthropic
Monday, November 25, 2024
Plus: The hidden market of body-centric data
⚡ THN Recap: Top Cybersecurity Threats, Tools & Tips (Nov 18-24)
Monday, November 25, 2024
Don't miss the vital updates you need to stay secure. Read the full recap now. The Hacker News THN Recap: Top Cybersecurity Threats, Tools, and Practices (Nov 18 - Nov 24) We hear terms like “state
Researchers Uncover Malware Using BYOVD to Bypass Antivirus Protections
Monday, November 25, 2024
THN Daily Updates Newsletter cover Generative AI For Dummies ($18.00 Value) FREE for a Limited Time Generate a personal assistant with generative AI Download Now Sponsored LATEST NEWS Nov 25, 2024 THN
Post from Syncfusion Blogs on 11/25/2024
Monday, November 25, 2024
New blogs from Syncfusion Build World-Class Flutter Apps with Globalization and Localization By Lavanya Anaimuthu This blog explains the globalization and localization features supported in the
Is there more to your iPhone?
Monday, November 25, 2024
Have you ever wondered if there's more to your iPhone than meets the eye? Maybe you've been using it for years, but certain powerful features and settings remain hidden. That's why we'
🎉 Black Friday Early Access: 50% OFF
Monday, November 25, 2024
Black Friday discount is now live! Do you want to master Clean Architecture? Only this week, access the 50% Black Friday discount. Here's what's inside: 7+ hours of lessons .NET Aspire coming
Open Pull Request #59
Monday, November 25, 2024
LightRAG, anything-llm, llm, transformers.js and an Intro to monads for software devs ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏