🟢⚪️ Edge#202: How to Ship ML-powered Apps with Baseten
Was this email forwarded to you? Sign up here 🟢⚪️ Edge#202: How to Ship ML-powered Apps with BasetenBuilding a performant model is just the start, what to do next?On Thursdays, we do deep dives into one of the freshest research papers or technology frameworks that is worth your attention. Our goal is to keep you up to date with new developments in AI and introduce to you the platforms that deal with the ML challenges. 💥 Deep Dive: How to Ship ML-powered Apps with BasetenIt's easier than ever to build machine learning (ML) models. With libraries like TensorFlow, Keras, Scikit-learn, and PyTorch, almost anyone with some basic coding skills can put together a model in a matter of days or weeks. But building a performant model is just the start. The challenge is delivering that model as a production-ready solution. This involves an entirely different set of skills and tasks, from standing up infrastructure to designing business-facing UI and everything in between. In this deep dive, we will see how Baseten, an end-to-end platform for delivering ML models as production-ready solutions, deals with these challenges. Model development vs. model deploymentThe first era of artificial intelligence (AI) engineers didn't have ML libraries, no-code platforms, or AutoML tools. They had to write their own algorithms from scratch. Deployment was similarly complex, requiring custom hardware and software. More recently, model development has become a far easier task. But model deployment has become more complex, with the need for highly robust, scalable solutions that can handle millions of predictions per day. The difference between these two areas is crucial to understand. Model development is typically well-understood by data scientists, but models need to be packaged and integrated as a software component, which demands an entirely different skillset. As a result, even simple models can take over six months and multiple engineering teams to deliver. Broadly speaking, there are five stages to delivering a machine learning solution:
Once data is prepared, training a model is an iterative, scientific process that's right in the wheelhouse of data scientists. Deploying a model is where things start to get complicated. This is the domain of software engineers, who need to wrap the model in a robust, scalable solution that can handle many predictions. Deployment includes containerizing the model in tools like Docker and standing up Kubernetes to manage those containers. Ideally, you’re also ensuring that your deployment pipeline has version control, dependency management, and debuggability to set your team up for success as it scales. Along with deployment, some backend work is required to handle pre- and post-processing at the time of inference. And to integrate the model into your broader stack, additional backend services are needed to call predictions and push predictions to other data stores and tools. Finally, if your model is intended for business users, you’ll need to build a frontend for teams to access and take action on predictions. This involves building a frontend view in HTML & CSS. The role of the data scientistIn most organizations, data scientists are responsible for data preparation and model building. After that, the lines start to blur. In some cases, data scientists hand off the model to an engineering team for deployment. In others, data scientists work with engineers throughout the entire process. Handling all aspects of an ML solution is a huge undertaking that requires many different skills. As a result, it's important to have a clear understanding of the role of the data scientist in the model delivery process. How Baseten solves these challengesWhat we like about Baseten is that it is built specifically for data scientists. This means you don't need to know much about MLOps, backend or frontend development in order to use it. Think about Baseten as an end-to-end platform for delivering machine learning models as production-ready solutions. Data scientists can deploy a model behind an API with a few lines of Python code right from one's Jupyter notebook. Conveniently enough, there's no need to learn new frameworks or toolkits. If you need more than an API, Baseten also makes it easy to integrate with other services and data stores. A simple drag-and-drop interface lets you design full-stack, interactive views for business users. And when it's time to ship your application, you can share a live link to your web app in a few clicks. Baseten comes with many models out of the box for a wide variety of tasks, for example:
And because Baseten is library-agnostic, you can deploy your models from TensorFlow, Scikit-learn, PyTorch, or your custom framework of choice. Worklets and Blocks to Create ApplicationsBaseten’s application logic is done through Worklets, which are visual representations of code and model execution backed by an API endpoint. You can think of it like a DAG. Baseten’s visual approach allows the data scientist to focus on their application’s business logic and flow, like "Classify an image" or "Detect objects in an image,” without needing to worry about infrastructure and DB instances. Worklets are composed of blocks, where each block acts as a work unit, helping keep code organized and easily understandable. For example, data scientists can add actions like “Invoke Model” or “Send a Slack Message” using Baseten’s pre-built blocks. For more specific tasks, you can write custom Python code. Teams can also test and debug code, track run logs, and connect and query from data stores – all from one central environment. Empowering all sorts of teams with MLData science is no longer the domain of a few PhDs in the basement. With the rise of tools like Baseten, organizations can more easily deploy and realize the impact of machine learning models across departments. After all, the use-cases of AI in business applications are very varied. Marketing teams, for instance, are facing steep competition in the digital landscape, limited budgets, and short timeframes to show results. They need all the help they can get to reach and engage their audiences. Machine learning can be used to detect user behavior patterns, predict which products or services a customer is most likely to buy, and build dynamic marketing lists for more effective campaigns. Sales teams are another department that can benefit from ML. With a never-ending stream of leads, it's brutally hard for sales reps to follow up with every single one. Machine learning can be used to automatically prioritize and score leads, so that reps can focus their time on the most promising prospects. Meanwhile, financial teams are under constant pressure to detect and prevent fraud. ML can help with real-time fraud detection, protecting organizations from costly losses. Customer support teams are also turning to ML to build chatbots that can handle routine inquiries and free up time for agents to focus on more complex issues. ConclusionThese are just a few examples of how machine learning can be used to empower teams across an organization. It’s one thing to build a model, but you need a reliable platform to deploy models for these and other tasks without needing to worry about the infrastructure or MLOps. Ultimately, if you're looking for a platform to help you deploy machine learning models, Baseten is worth considering. It's easy to use, fast to get going, library-agnostic, and provides a complete solution for shipping full-stack applications. You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
🎙 Google’s Allen Day on Using ML in the Cryptocurrency Space
Wednesday, June 22, 2022
It's so inspiring to learn from practitioners and thinkers. Getting to know the experience gained by researchers, engineers, and entrepreneurs doing real ML work is an excellent source of insight
Sign in to TheSequence
Tuesday, June 21, 2022
. Here's a link to sign in to TheSequence. This link can only be used once and expires after 24 hours. Sign in now © 2022 Jesus Rodriguez, Ksenia Semenova 75 Miracle Mile, Suite 7688, Coral Gables,
💠 Edge#201: Understanding Graph Convolutional Neural Networks
Tuesday, June 21, 2022
In this issue: we explain Graph Convolutional Neural Networks; we overview the original GCN Paper; we explore PyTorch Geometric, one of the most complete GNN frameworks available today. Enjoy the
📌 Event: Explore the future of scalable AI & more at Ray Summit: August 23-24 in San Francisco!
Monday, June 20, 2022
Must-attend
🔵⚪️ The Alexa Factor
Sunday, June 19, 2022
Weekly news digest curated by the industry insiders
You Might Also Like
Daily Coding Problem: Problem #1619 [Hard]
Monday, November 25, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given two non-empty binary trees s and t , check whether tree t has exactly the
Unpacking “Craft” in the Software Interface & The Five Pillars of Creative Flow
Monday, November 25, 2024
Systems Over Substance, Anytype's autumn updates, Ghost's progress with its ActivityPub integration, and a lot more in this week's issue of Creativerly. Creativerly Unpacking “Craft” in the
What Investors Want From AI Startups in 2025
Monday, November 25, 2024
Top Tech Content sent at Noon! How the world collects web data Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, November 25, 2024? The HackerNoon
GCP Newsletter #426
Monday, November 25, 2024
Welcome to issue #426 November 25th, 2024 News LLM Official Blog Vertex AI Announcing Mistral AI's Large-Instruct-2411 on Vertex AI - Google Cloud has announced the availability of Mistral AI's
⏳ 36 Hours Left: Help Get "The Art of Data" Across the Finish Line 🏁
Monday, November 25, 2024
Visual Capitalist plans to unveal its secrets behind data storytelling, but only if the book hits its minimum funding goal. View Online | Subscribe | Download Our App We Need Your Help Only 36 Hours
DeveloPassion's Newsletter #180 - Black Friday Week
Monday, November 25, 2024
Edition 180 of my newsletter, discussing Knowledge Management, Knowledge Work, Zen Productivity, Personal Organization, and more! Sébastien Dubois DeveloPassion's Newsletter DeveloPassion's
Meet HackerNoon's Latest Features: Boost Stories with Translations, Speech-to-Text & More
Monday, November 25, 2024
Hey, Hacker! HackerNoon's monthly product update is here! Get ready for a new version of the mobile app, more translation developments, a new AI Gallery, backend moves, and more! 🚀 This product
The ultimate holiday gadget gift
Monday, November 25, 2024
AI isn't hitting a wall; $70 off Apple Watch; 60+ Amazon deals -- ZDNET ZDNET Tech Today - US November 25, 2024 Meta Quest 3S Why the Meta Quest 3S is the ultimate 2024 holiday present This $299
Deduplication in Distributed Systems: Myths, Realities, and Practical Solutions
Monday, November 25, 2024
This week, we'll discuss the deduplication strategies. We'll see whether they're useful and consider scenarios where you may need them. We'll also do a reality check with the promises
How to know if your data has been exposed
Monday, November 25, 2024
How do you know if your personal data has been leaked? Imagine getting an instant notification if your SSN, credit card, or password has been exposed on the dark web — so you can take action