🗄 A Model Compression Library You Need to Know About
Was this email forwarded to you? Sign up here 🗄 A Model Compression Library You Need to Know AboutWeekly news digest curated by the industry insiders📝 EditorialThe machine learning (ML) space is currently dominated by large models that often have computation requirements impossible for most organizations. Model compression is one of the disciplines that has been targeting that challenge by creating smaller models without sacrificing accuracy. Despite the obvious need, model compression remains a challenge for ML engineering teams as most frameworks in the space are relatively nascent. As a result, you rarely hear about ML engineering pipelines that incorporate model compression as a native building block. Quite the opposite, model compression tends to be one of those things that you only consider once the problem is too big to ignore; literally 😉 Last week, Microsoft Research open-sourced a new framework that attempts to streamline compression in deep learning models. DeepSpeed Compression is part of the DeepSpeed platform aimed to address the challenges of large-scale AI systems. The framework provides a catalog of common model compression techniques abstracted using a consistent programming model. The initial experiments showed up to 32x compression rates in large transformer architectures such as BERT. If DeepSpeed Compression follows the path to other frameworks in the DeepSpeed family, it could be productized as part of the Azure ML platform and streamline the adoption of compression methods in deep learning architectures. DeepSpeed Compression is definitely a framework to follow by the ML engineering community. 🔺🔻TheSequence Scope – our Sunday edition with the industry’s development overview – is free. To receive high-quality content about the most relevant developments in the ML world every Tuesday and Thursday, please subscribe to TheSequence Edge 🔺🔻 🗓 Next week in TheSequence Edge: Edge#211: we discuss what to test in ML models; explain how Meta uses A/B testing to improve Facebook’s newsfeed algorithm; explore Meta’s Ax, a framework for A/B testing in PyTorch. Edge#212: we dive deep inside the Masterful CLI Trainer, a low-code CV model development platform. Now, let’s review the most important developments in the AI industry this week 🔎 ML ResearchGeneralist Reinforcement Learning Agents Google Research published a paper unveiling a generalist reinforcement learning agent that can play many video games simultaneously →read more on Google Research blog Outlier Root Cause Analysis Amazon Research published a paper outlining a technique to detect the root causes of statistical outliers →read more on Amazon Research blog CodeRL Salesforce Research published a paper and open-sourced code for CodeRL, a reinforcement learning framework for program synthesis →read more on Salesforce Research blog The Algorithms Behind Transformers DeepMind published a research paper detailing the algorithms and mathematical foundations of transformer architectures →read more in the original research paper from DeepMind ☝️ We Recommend – Join this webinar and discover the Hopsworks 3.0 release!In this talk, Hopsworks VP of engineering will explore new capabilities in Hopsworks feature store 3.0 and how it can help data scientists who love Python to manage their features for training and serving models. He will also native Python support for feature engineering, feature pipelines, feature views that represent models in the feature store, transformation functions, and data validation with Great Expectations. Join us on Aug 3, at 7 PM CEST. 🤖 Cool AI Tech ReleasesDeepSpeed Compression Microsoft Research open-sourced DeepSpeed Compression, a framework for compression and system optimization in deep learning models →read more on Microsoft Research blog DALL-E Beta OpenAI expanded the availability of DALL-E to over a million people on the waitlist →read more on OpenAI blog New Tools and Frameworks for Alexa Amazon unveiled a series of new developer frameworks and tools for Alexa that improve developers’ and device makers’ experience →read more on Amazon Developer blog PlayTorch App PyTorch open-sourced the PlayTorch app to streamline the development of mobile AI experiences →read more on PyTorch blog 🛠 Real World MLOut of Memory Predictions at Netflix Netflix discusses the architecture powering ML models used to predict memory capacity errors in TVs and set-top boxes →read more on Netflix tech blog 💸 Money in AI
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
📌 Event: Join us for this live webinar to learn how Tide reduced model deployment time by 50%!
Friday, July 22, 2022
A real use case you don't want to miss!
🟢⚪️ Edge#210: Hopsworks 3.0, Connecting Python to the Modern Data Stack
Thursday, July 21, 2022
On Thursdays, we deep dive into one of the freshest research papers or technology frameworks that is worth your attention. Our goal is to keep you up to date with new developments in AI and introduce
📌 Event: Join us at The Future of Data-Centric AI 2022 — a free virtual event by Snorkel AI
Wednesday, July 20, 2022
We're excited to partner with Snorkel AI on The Future of Data-Centric AI, a free two-day virtual event on August 3-4 that will cover the latest data-centric approaches to AI application
🔂 Edge#209: A New Series About ML Testing
Tuesday, July 19, 2022
Welcome to our premium newsletter that helps you learn ML concepts and focuses on the projects that move the AI industry forward. The content is trusted by the main AI labs, universities, enterprises,
📌 Event: A dive into continuous training automation – webinar by Superwise
Monday, July 18, 2022
Join us on August 9th for a live coding session as we build out a continuous MLOps pipeline. We'll start with the ML pipeline and see how we can detect performance degradation and data drift in
You Might Also Like
DeveloPassion's Newsletter #180 - Black Friday Week
Monday, November 25, 2024
Edition 180 of my newsletter, discussing Knowledge Management, Knowledge Work, Zen Productivity, Personal Organization, and more! Sébastien Dubois DeveloPassion's Newsletter DeveloPassion's
Meet HackerNoon's Latest Features: Boost Stories with Translations, Speech-to-Text & More
Monday, November 25, 2024
Hey, Hacker! HackerNoon's monthly product update is here! Get ready for a new version of the mobile app, more translation developments, a new AI Gallery, backend moves, and more! 🚀 This product
The ultimate holiday gadget gift
Monday, November 25, 2024
AI isn't hitting a wall; $70 off Apple Watch; 60+ Amazon deals -- ZDNET ZDNET Tech Today - US November 25, 2024 Meta Quest 3S Why the Meta Quest 3S is the ultimate 2024 holiday present This $299
Deduplication in Distributed Systems: Myths, Realities, and Practical Solutions
Monday, November 25, 2024
This week, we'll discuss the deduplication strategies. We'll see whether they're useful and consider scenarios where you may need them. We'll also do a reality check with the promises
How to know if your data has been exposed
Monday, November 25, 2024
How do you know if your personal data has been leaked? Imagine getting an instant notification if your SSN, credit card, or password has been exposed on the dark web — so you can take action
⚙️ Amazon and Anthropic
Monday, November 25, 2024
Plus: The hidden market of body-centric data
⚡ THN Recap: Top Cybersecurity Threats, Tools & Tips (Nov 18-24)
Monday, November 25, 2024
Don't miss the vital updates you need to stay secure. Read the full recap now. The Hacker News THN Recap: Top Cybersecurity Threats, Tools, and Practices (Nov 18 - Nov 24) We hear terms like “state
Researchers Uncover Malware Using BYOVD to Bypass Antivirus Protections
Monday, November 25, 2024
THN Daily Updates Newsletter cover Generative AI For Dummies ($18.00 Value) FREE for a Limited Time Generate a personal assistant with generative AI Download Now Sponsored LATEST NEWS Nov 25, 2024 THN
Post from Syncfusion Blogs on 11/25/2024
Monday, November 25, 2024
New blogs from Syncfusion Build World-Class Flutter Apps with Globalization and Localization By Lavanya Anaimuthu This blog explains the globalization and localization features supported in the
Is there more to your iPhone?
Monday, November 25, 2024
Have you ever wondered if there's more to your iPhone than meets the eye? Maybe you've been using it for years, but certain powerful features and settings remain hidden. That's why we'