Predicting Hard Drive Failure with Machine Learning
#140 — January 25, 2021 | View in browser |
AI Digest
Spread the word, build the community, share the knowledge – invite your friends.
sponsor
2.2M developers learned Python last year. Are you one of them?
Share your views about the most important programming languages, tools, and technologies for machine learning and data science in 2021. Take the Developer Economics survey and receive free resources to plan your next career move, plus a chance to win a new smartphone, gaming laptop, licenses, Amazon vouchers, and more. The survey is open until February.
this week's favorite
Predicting Hard Drive Failure with Machine Learning
We’ve all had a hard drive fail on us, and often it’s as sudden as booting your machine and realizing you can’t access a bunch of your files. It’s not a fun experience. It’s especially not fun when you have an entire data center full of drives that are all important to keeping your business running. What if we could predict when one of those drives would fail, and get ahead of it by preemptively replacing the hardware before the data is lost? This is where the history of predictive drive failure begins.
NLPRule: A library for fast grammatical error correction
NLPRule is a library for rule-based grammatical error correction written in pure Rust with bindings for Python. Rules are sourced from LanguageTool.
How Facebook uses AI to improve photo descriptions for visually impaired people
When Facebook users scroll through their News Feed, they find all kinds of content — articles, friends’ comments, event invitations, and of course, photos. Most people are able to instantly see what’s in these images, whether it’s their new grandchild, a boat on a river, or a grainy picture of a band onstage. But many users who are blind or visually impaired (BVI) can also experience that imagery, provided it’s tagged properly with alternative text (or “alt text”). A screen reader can describe the contents of these images using a synthetic voice and enable people who are BVI to understand images in their Facebook feed.
3 deep learning mysteries: Ensemble, knowledge- and self-distillation
Under now-standard techniques, such as over-parameterization, batch-normalization, and adding residual links, “modern age” neural network training—at least for image classification tasks and many others—is usually quite stable. Using standard neural network architectures and training algorithms (typically SGD with momentum), the learned models perform consistently well, not only in terms of training accuracy but even in test accuracy, regardless of which random initialization or random data order is used during the training.
Datasets should behave like git repositories
Problems emerging from data are common in research as well as in the industry. Those problems are dealt with as part of our project, but we usually don't bother solving them at their origin. We fix the data locally once, and we go on with our project. This is certainly a valid method in some cases, but as we share data more and more between projects, we are finding ourselves repeating the same processes over time and across teams. This issue is particularly true for public datasets shared by many people to train many machine learning models. I will show you how to create, maintain, and contribute to a long-living dataset that will update itself automatically across projects, using git and DVC as versioning systems, and DAGsHub as a host for the datasets.
newsletters
Older messages
We Don't Need Data Scientists, We Need Data Engineers
Sunday, January 17, 2021
And more news, tutorials and articles about AI, machine learning, and data science in this week's issue. #139 — January 18, 2021 View in browser AI Digest Spread the word, build the community,
Which Machine Learning Classifiers are best for small datasets?
Sunday, January 10, 2021
And more news, tutorials and articles about AI, machine learning, and data science in this week's issue. #138 — January 11, 2021 View in browser AI Digest Spread the word, build the community,
RoboLeague: A clone of Rocket League for AI experiments
Sunday, January 3, 2021
And more news, tutorials and articles about AI, machine learning, and data science in this week's issue. #137 — January 04, 2021 View in browser AI Digest Spread the word, build the community,
How NOT to learn Machine Learning
Sunday, December 27, 2020
And more news, tutorials and articles about AI, machine learning, and data science in this week's issue. #136 — December 28, 2020 View in browser AI Digest Spread the word, build the community,
How to manage your data the way you manage your code
Sunday, December 20, 2020
And more news, tutorials and articles about AI, machine learning, and data science in this week's issue. #135 — December 21, 2020 View in browser AI Digest Spread the word, build the community,
You Might Also Like
Tuesday Triage #200 and giveaway
Tuesday, May 14, 2024
Your weekly crème de la crème of the Internet is here! The 200th edition featuring annual subscriptions giveaway, thoughts on nearly four years of ...
🎮 How AI Tools Are Changing Game Development — Grab a Pixel 8a Instead of Waiting for Pixel 9
Tuesday, May 14, 2024
Also: Sharing Your Google Maps Trip Progress, and More! How-To Geek Logo May 14, 2024 Did You Know In a bid to keep the ingredients secret, WD-40 was never patented. 🤖 The New GPT It's Tuesday!
Meta shuts down Workplace
Tuesday, May 14, 2024
Plus: Everything that happened at Google I/O and AWS CEO steps down View this email online in your browser By Christine Hall Tuesday, May 14, 2024 Hello, and welcome back to TechCrunch PM. The team
Flattening Lists of Lists, Python 3.13, Sets, and More
Tuesday, May 14, 2024
Flattening a List of Lists in Python #629 – MAY 14, 2024 VIEW IN BROWSER The PyCoder's Weekly Logo Flattening a List of Lists in Python In this video course, you'll learn how to flatten a list
Daily Coding Problem: Problem #1441 [Easy]
Tuesday, May 14, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. UTF-8 is a character encoding that maps each symbol to one, two, three, or four
Noonification: 3 Quick Ways to Optimize RecyclerView
Tuesday, May 14, 2024
Top Tech Content sent at Noon! Get Algolia: AI Search that understands How are you, @newsletterest1? 🪐 What's happening in tech today, May 14, 2024? The HackerNoon Newsletter brings the HackerNoon
Using 97 fewer cores thanks to PGO
Tuesday, May 14, 2024
Plus an HNSW indexed vector store library, a new Go game hits the Steam store, and is 'ok' ok?. | #507 — May 14, 2024 Unsub | Web Version Together with Stytch logo Go Weekly Reclaiming CPU for
Ranked | The Top 6 Economies by Share of Global GDP (1980-2024) 📈
Tuesday, May 14, 2024
Gain a unique perspective on the world's economic order from this graphic showing percentage share of global GDP over time. View Online | Subscribe Presented by: Data that drives the
Free online event this Thursday: Getting ahead with time series data
Tuesday, May 14, 2024
Free Online Event Do you know how your competitors use time series data to get ahead? Join us on Thursday, May 16 at 10am PT/1pm ET for a free, hour-long online fireside chat called “Unleash the Full
Here's the deal
Tuesday, May 14, 2024
We wanted you to be among the first to know about our plans to relaunch the Gigantic training courses that Product Collective now powers! Here's the deal: From May 20th - May 31st, anybody that