TheSequence - 🗣🗣🗣 No Language Left Behind
Was this email forwarded to you? Sign up here 📝 EditorialNatural language understanding (NLU) is the area of deep learning that has seen the most impressive breakthroughs in recent years. However, most of the large-scale NLU models that impressed us are regularly optimized for a small set of high-resource languages. NLU models that exhibit remarkable performance in areas such as question answering, text completion and machine translation in languages like English, Spanish or French struggle when applied to hundreds of dialects that don’t possess large training datasets. The result is that there is growing inequality among the segments of the world population that can benefit from high-quality NLU solutions. This disparity is even more apparent for languages spoken outside Europe and North America. Extending NLU research to low-resource languages is a known challenge in the space. One of the most impressive achievements of recent years came last week from Meta AI with the release of the No Language Left Behind (NLLB)-200 model. This single neural network is able to translate text from 200 different languages achieving state-of-the-art results. To train NLLB-200, Meta AI used a technique two-step curriculum approach in which knowledge acquired from high-resource language training epochs was used in low-resource languages. The result was a massive 54 billion parameter model that had to be trained in Meta’s new Research SuperCluster (RSC) supercomputer. Together with NLLB-200, Meta AI open-sourced the FLORES-200 dataset for evaluating machine translation models. It also provides $200,000.00 in grants to non-profit organizations building applications that use NLLB-200. All together, NLLB-200 represents one of the most impressive milestones ever achieved in machine translation for low-resource languages. 🔺🔻TheSequence Scope – our Sunday edition with the industry’s development overview – is free. To receive high-quality content about the most relevant developments in the ML world every Tuesday and Thursday, please subscribe to TheSequence Edge 🔺🔻 🗓 Next week in TheSequence Edge: Edge#207: we summarize our graph neural networks (GNNs) series. Edge#208: we explore Google Brain’s Minerva who can solve complex mathematical and scientific problems using step-by-step reasoning. Now, let’s review the most important developments in the AI industry this week 🔎 ML ResearchTranslating Across 200 Languages Meta AI published a paper detailing a new model that can perform high-quality translations across 200 languages →read more on Meta AI blog Director – a Hierarchical RL Agent Google Research published a paper detailing Director, a hierarchical reinforcement learning agent that can learn hierarchical behaviors from raw pixels →read more on Google Reseach blog Joint Image-Text Representations Amazon Research published a paper presenting a model for alignment of features in image and text datasets →read more on Amazon Research blog Disfluency Speech Detection Google Research published a paper detailing a BERT-like model that can detect disfluency in natural speech →read more on Google Research blog ☝️ We Recommend – Try the Real-Time Database for Continuously Changing DataYou can now enroll in Molecula’s 7-day Cloud trial (without installation or infrastructure management) or install FeatureBase in your own environment to meet your needs (no credit card required) →See which trial experience is right for you 🤖 Cool AI Tech ReleasesPyTorch 1.12 A new release of PyTorch is available with capabilities with Torch Arrow for batch data preprocessing, a functional API for modules and many others →read more on PyTorch blog 🛠 Real World MLAnomaly Detection at Walmart Walmart details the ML architecture used for anomaly detection in its e-commerce infrastructure →read more on the Walmart Tech Labs blog Uber Spark Architecture Uber discusses some of the updates for data shuffling in its Spark architecture →read more on Uber Engineering blog 💸 Money in AI
Acquisitions
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
📌 Free 7-Day Trial of FeatureBase, the Real-Time Database for Continuously Changing Data
Friday, July 8, 2022
We're excited to support Molecula's launch of FeatureBase and offer you a 7-day Trial. You can either enroll in a Cloud trial (without installation or infrastructure management) or install
🟩⬛️ Edge#206: OpenAI’s New Transformer Model Mastered Minecraft by Using Unlabeled Videos
Thursday, July 7, 2022
One of the first applications of transformer models to video intelligence
😱 Flash 50% OFF
Wednesday, July 6, 2022
A unique offer to celebrate TheSequence's 2nd Anniversary!
☝️⚙️ Edge#205: What is Graph Attention Network?
Tuesday, July 5, 2022
Welcome to our premium newsletter that help you learn ML concepts and focuses on the projects that move the AI industry forward. The content is unique and trusted by the main AI labs, universities,
♦️⚡️♦️ Databricks' New ML Announcements
Sunday, July 3, 2022
Databricks has been one of the companies that have been at the center of the big data movement, pioneering technologies such as Apache Spark. Machine learning (ML) has been a native component of Spark
You Might Also Like
What Investors Want From AI Startups in 2025
Monday, November 25, 2024
Top Tech Content sent at Noon! How the world collects web data Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, November 25, 2024? The HackerNoon
GCP Newsletter #426
Monday, November 25, 2024
Welcome to issue #426 November 25th, 2024 News LLM Official Blog Vertex AI Announcing Mistral AI's Large-Instruct-2411 on Vertex AI - Google Cloud has announced the availability of Mistral AI's
⏳ 36 Hours Left: Help Get "The Art of Data" Across the Finish Line 🏁
Monday, November 25, 2024
Visual Capitalist plans to unveal its secrets behind data storytelling, but only if the book hits its minimum funding goal. View Online | Subscribe | Download Our App We Need Your Help Only 36 Hours
DeveloPassion's Newsletter #180 - Black Friday Week
Monday, November 25, 2024
Edition 180 of my newsletter, discussing Knowledge Management, Knowledge Work, Zen Productivity, Personal Organization, and more! Sébastien Dubois DeveloPassion's Newsletter DeveloPassion's
Meet HackerNoon's Latest Features: Boost Stories with Translations, Speech-to-Text & More
Monday, November 25, 2024
Hey, Hacker! HackerNoon's monthly product update is here! Get ready for a new version of the mobile app, more translation developments, a new AI Gallery, backend moves, and more! 🚀 This product
The ultimate holiday gadget gift
Monday, November 25, 2024
AI isn't hitting a wall; $70 off Apple Watch; 60+ Amazon deals -- ZDNET ZDNET Tech Today - US November 25, 2024 Meta Quest 3S Why the Meta Quest 3S is the ultimate 2024 holiday present This $299
Deduplication in Distributed Systems: Myths, Realities, and Practical Solutions
Monday, November 25, 2024
This week, we'll discuss the deduplication strategies. We'll see whether they're useful and consider scenarios where you may need them. We'll also do a reality check with the promises
How to know if your data has been exposed
Monday, November 25, 2024
How do you know if your personal data has been leaked? Imagine getting an instant notification if your SSN, credit card, or password has been exposed on the dark web — so you can take action
⚙️ Amazon and Anthropic
Monday, November 25, 2024
Plus: The hidden market of body-centric data
⚡ THN Recap: Top Cybersecurity Threats, Tools & Tips (Nov 18-24)
Monday, November 25, 2024
Don't miss the vital updates you need to stay secure. Read the full recap now. The Hacker News THN Recap: Top Cybersecurity Threats, Tools, and Practices (Nov 18 - Nov 24) We hear terms like “state