TheSequence - 🗣🗣🗣 No Language Left Behind
Was this email forwarded to you? Sign up here 📝 EditorialNatural language understanding (NLU) is the area of deep learning that has seen the most impressive breakthroughs in recent years. However, most of the large-scale NLU models that impressed us are regularly optimized for a small set of high-resource languages. NLU models that exhibit remarkable performance in areas such as question answering, text completion and machine translation in languages like English, Spanish or French struggle when applied to hundreds of dialects that don’t possess large training datasets. The result is that there is growing inequality among the segments of the world population that can benefit from high-quality NLU solutions. This disparity is even more apparent for languages spoken outside Europe and North America. Extending NLU research to low-resource languages is a known challenge in the space. One of the most impressive achievements of recent years came last week from Meta AI with the release of the No Language Left Behind (NLLB)-200 model. This single neural network is able to translate text from 200 different languages achieving state-of-the-art results. To train NLLB-200, Meta AI used a technique two-step curriculum approach in which knowledge acquired from high-resource language training epochs was used in low-resource languages. The result was a massive 54 billion parameter model that had to be trained in Meta’s new Research SuperCluster (RSC) supercomputer. Together with NLLB-200, Meta AI open-sourced the FLORES-200 dataset for evaluating machine translation models. It also provides $200,000.00 in grants to non-profit organizations building applications that use NLLB-200. All together, NLLB-200 represents one of the most impressive milestones ever achieved in machine translation for low-resource languages. 🔺🔻TheSequence Scope – our Sunday edition with the industry’s development overview – is free. To receive high-quality content about the most relevant developments in the ML world every Tuesday and Thursday, please subscribe to TheSequence Edge 🔺🔻 🗓 Next week in TheSequence Edge: Edge#207: we summarize our graph neural networks (GNNs) series. Edge#208: we explore Google Brain’s Minerva who can solve complex mathematical and scientific problems using step-by-step reasoning. Now, let’s review the most important developments in the AI industry this week 🔎 ML ResearchTranslating Across 200 Languages Meta AI published a paper detailing a new model that can perform high-quality translations across 200 languages →read more on Meta AI blog Director – a Hierarchical RL Agent Google Research published a paper detailing Director, a hierarchical reinforcement learning agent that can learn hierarchical behaviors from raw pixels →read more on Google Reseach blog Joint Image-Text Representations Amazon Research published a paper presenting a model for alignment of features in image and text datasets →read more on Amazon Research blog Disfluency Speech Detection Google Research published a paper detailing a BERT-like model that can detect disfluency in natural speech →read more on Google Research blog ☝️ We Recommend – Try the Real-Time Database for Continuously Changing DataYou can now enroll in Molecula’s 7-day Cloud trial (without installation or infrastructure management) or install FeatureBase in your own environment to meet your needs (no credit card required) →See which trial experience is right for you 🤖 Cool AI Tech ReleasesPyTorch 1.12 A new release of PyTorch is available with capabilities with Torch Arrow for batch data preprocessing, a functional API for modules and many others →read more on PyTorch blog 🛠 Real World MLAnomaly Detection at Walmart Walmart details the ML architecture used for anomaly detection in its e-commerce infrastructure →read more on the Walmart Tech Labs blog Uber Spark Architecture Uber discusses some of the updates for data shuffling in its Spark architecture →read more on Uber Engineering blog 💸 Money in AI
Acquisitions
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
📌 Free 7-Day Trial of FeatureBase, the Real-Time Database for Continuously Changing Data
Friday, July 8, 2022
We're excited to support Molecula's launch of FeatureBase and offer you a 7-day Trial. You can either enroll in a Cloud trial (without installation or infrastructure management) or install
🟩⬛️ Edge#206: OpenAI’s New Transformer Model Mastered Minecraft by Using Unlabeled Videos
Thursday, July 7, 2022
One of the first applications of transformer models to video intelligence
😱 Flash 50% OFF
Wednesday, July 6, 2022
A unique offer to celebrate TheSequence's 2nd Anniversary!
☝️⚙️ Edge#205: What is Graph Attention Network?
Tuesday, July 5, 2022
Welcome to our premium newsletter that help you learn ML concepts and focuses on the projects that move the AI industry forward. The content is unique and trusted by the main AI labs, universities,
♦️⚡️♦️ Databricks' New ML Announcements
Sunday, July 3, 2022
Databricks has been one of the companies that have been at the center of the big data movement, pioneering technologies such as Apache Spark. Machine learning (ML) has been a native component of Spark
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your