TheSequence - 🌅 The Era of Foundation Models is Here
Was this email forwarded to you? Sign up here 📝 EditorialThe term ‘foundation models’ is becoming one of the hottest buzzwords in the machine learning (ML) lingo. Researchers from Stanford University originally coined the term to describe models that have been trained in large amounts of unlabeled data and can be fine-tuned to specific domains. Think about fine-tuning GPT-like models for domains such as law or science. Foundation models are shifting the ML development paradigm from creating brand-new models to fine-tuning large pretrained models. The efforts around foundation models are increasing remarkably fast. Stanford University created the Center for Research on Foundation Models (CRFM), a new initiative focused on studying best practices around foundation models. Just this week, Snorkel AI released Data-centric Foundation Model Development, a new series of addition to the Snorkel Flow platform to fine-tune and distill foundation models. Meta AI also unveiled details about MultiRay, their platform for running foundation models at scale. Finally, the CRFM team unveiled a new benchmark to facilitate the holistic evaluation of foundation models. Foundation models efforts are popping up everywhere, from large AI labs to innovative startups. Building by fine-tuning the new paradigm. The era of foundation models is definitely upon us! 🔺🔻TheSequence Scope – our Sunday edition with the industry’s development overview – is free. To receive high-quality content about the most relevant developments in the ML world every Tuesday and Thursday, please subscribe to TheSequence Edge 🔺🔻 🗓 Next week in TheSequence Edge: Edge#245: we start a new series about machine learning interpretability; discuss Manifold, an architecture for debugging ML models; explore Meta’s Captum, a framework for deep learning interpretability. Edge#246: we discuss OpenAI’s best practices that they used to mitigate risks while training DALL-E2 📌 Our LinkedIn accountIn this uncertain times for Twitter, we’d like to introduce to you TheSequence’s LinkedIn account. We are building a unique resource and support system for all ML&AI aficionados. Let’s connect! Now, let’s review the most important developments in the AI industry this week 🔎 ML ResearchHELM Stanford University published HELM, a benchmark for the holistic evaluation of foundation models →read more MultiRay Meta AI discusses MultiRay, the architecture used to power large foundation ML models at scale across their different organizations →read more Data Enrichment Practices DeepMind published an insightful paper discussing human data collection best practices used in real-world ML scenarios →read more MoE with Expert Routing Google Research published a research paper proposing a routing algorithm in mixture of experts (MoE) neural networks →read more 🤖 Cool AI Tech ReleasesData-centric Foundation Model Development Snorkel AI released Data-centric Foundation Model Development, a new set of capabilities in the Snorkel Flow platform to adapt large foundation models to domain-specific scenarios →read more Data Cards Playbook Google Brain released Data Cards Playbook, a toolkit for transparency in ML datasets →read more 🛠 Real World MLAnomaly Detection in Prime Video Amazon Science discusses the ML techniques used for anomaly detection in their Prime Video application →read more Einstein Search Answers Salesforce Research discusses the ML techniques powering Einstein Search Answers, a new search architecture for customer support →read more Netflix Video Quality Netflix discusses the neural network techniques used for video encoding optimizations in the media giant →read more 💸 Money in AIML&AI&Data
AI-powered
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Key phrases
Older messages
📝 Guest post: Using One Methodology to Solve The Three Failure Modes
Friday, November 18, 2022
In this guest post, Eric Landau, CEO of Encord, discusses the three major model failure modes that prevent models from reaching the production state and solve all three problems with a single
🗣👥 Edge#244: This Google Model Combines Reasoning and Acting in a Single Language Model
Thursday, November 17, 2022
ReAct provides an architecture that triggers actions based on language reasoning paths
📌 Event: apply(recsys)—ML experts from Slack, ByteDance & more share their recommender system learnings
Wednesday, November 16, 2022
Are you building an ML recommender system or planning to? Then you won't want to miss apply(recsys)
🔂 Edge#243: Text-to-Image Synthesis Models – Recap
Tuesday, November 15, 2022
Our longest and the most popular series
☝️CoreWeave to Offer NVIDIA HGX H100 Supercomputers - Supporting Cutting Edge AI & ML Companies*
Monday, November 14, 2022
CoreWeave is proud to be among the first providers to offer cloud instances with NVIDIA HGX H100 supercomputers. NVIDIA's HGX H100 platform represents a major leap forward for the AI community,
You Might Also Like
AI search engine startup Perplexity eyes a $3B valuation
Tuesday, April 23, 2024
Plus: It's Tesla earnings day and AWS wants to host your AI models View this email online in your browser By Cody Corrall Tuesday, April 23, 2024 Welcome back to TechCrunch PM. Today we have big
🎞️ We Tried 3D Printing a Photo — You'll Love This Secret Samsung Galaxy Bluetooth Feature
Tuesday, April 23, 2024
Also: Transferring Your Phone Number to a New Carrier, and More! How-To Geek Logo April 23, 2024 📩 Get expert reviews, the hottest deals, how-to's, breaking news, and more delivered directly to
You're invited – product sense, prioritization, careers
Tuesday, April 23, 2024
Product Sense Product Sense Wednesday, May 1st @ 01:00 PM EST Learn how to identify opportunities, assess risks, and make informed decisions that lead to successful product innovations by better
CTRL-C, Exceptions, Ruff Speed-up, and More
Tuesday, April 23, 2024
Asyncio Handle Control-C (SIGINT) #626 – APRIL 23, 2024 VIEW IN BROWSER The PyCoder's Weekly Logo Asyncio Handle Control-C (SIGINT) When the user presses CTRL-C on the keyboard, the OS raises an
Writing Contests Just Landed On Product Hunt 🔥
Tuesday, April 23, 2024
Upvote us to keep the $$$ coming! 👍 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Daily Coding Problem: Problem #1421 [Hard]
Tuesday, April 23, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Uber. Given an array of integers, return a new array such that each element at index i
Ranked | The Top 10 EV Battery Manufacturers 🔋
Tuesday, April 23, 2024
Asia dominates this ranking of the world's largest EV battery manufacturers in 2023. See which battery makers feature in the top 10. View Online | Subscribe Presented by: EnergyX's
Bringing PGO to the build pipeline
Tuesday, April 23, 2024
Plus how Go grew at Google, cmp.Or, and ways to visualize makefiles, Go binaries, and live Go processes. | #504 — April 23, 2024 Unsub | Web Version Together with Three Dots Labs Go Weekly How Dolt
Noonification: Leetcode: Two-sum an Intuitive Approach
Tuesday, April 23, 2024
Top Tech Content sent at Noon! Get Algolia: AI Search that understands How are you, @newsletterest1? 🪐 What's happening in tech this week: The Noonification by HackerNoon has got you covered with
The best AI chatbot for coding
Tuesday, April 23, 2024
9 video gadget must-haves; 6 things Linux should borrow from MacOS -- ZDNET ZDNET Tech Today - US April 23, 2024 placeholder Can Meta AI code? I tested it against Llama, Gemini and ChatGPT - it wasn