TheSequence - 🌅 The Era of Foundation Models is Here
Was this email forwarded to you? Sign up here 📝 EditorialThe term ‘foundation models’ is becoming one of the hottest buzzwords in the machine learning (ML) lingo. Researchers from Stanford University originally coined the term to describe models that have been trained in large amounts of unlabeled data and can be fine-tuned to specific domains. Think about fine-tuning GPT-like models for domains such as law or science. Foundation models are shifting the ML development paradigm from creating brand-new models to fine-tuning large pretrained models. The efforts around foundation models are increasing remarkably fast. Stanford University created the Center for Research on Foundation Models (CRFM), a new initiative focused on studying best practices around foundation models. Just this week, Snorkel AI released Data-centric Foundation Model Development, a new series of addition to the Snorkel Flow platform to fine-tune and distill foundation models. Meta AI also unveiled details about MultiRay, their platform for running foundation models at scale. Finally, the CRFM team unveiled a new benchmark to facilitate the holistic evaluation of foundation models. Foundation models efforts are popping up everywhere, from large AI labs to innovative startups. Building by fine-tuning the new paradigm. The era of foundation models is definitely upon us! 🔺🔻TheSequence Scope – our Sunday edition with the industry’s development overview – is free. To receive high-quality content about the most relevant developments in the ML world every Tuesday and Thursday, please subscribe to TheSequence Edge 🔺🔻 🗓 Next week in TheSequence Edge: Edge#245: we start a new series about machine learning interpretability; discuss Manifold, an architecture for debugging ML models; explore Meta’s Captum, a framework for deep learning interpretability. Edge#246: we discuss OpenAI’s best practices that they used to mitigate risks while training DALL-E2 📌 Our LinkedIn accountIn this uncertain times for Twitter, we’d like to introduce to you TheSequence’s LinkedIn account. We are building a unique resource and support system for all ML&AI aficionados. Let’s connect! Now, let’s review the most important developments in the AI industry this week 🔎 ML ResearchHELM Stanford University published HELM, a benchmark for the holistic evaluation of foundation models →read more MultiRay Meta AI discusses MultiRay, the architecture used to power large foundation ML models at scale across their different organizations →read more Data Enrichment Practices DeepMind published an insightful paper discussing human data collection best practices used in real-world ML scenarios →read more MoE with Expert Routing Google Research published a research paper proposing a routing algorithm in mixture of experts (MoE) neural networks →read more 🤖 Cool AI Tech ReleasesData-centric Foundation Model Development Snorkel AI released Data-centric Foundation Model Development, a new set of capabilities in the Snorkel Flow platform to adapt large foundation models to domain-specific scenarios →read more Data Cards Playbook Google Brain released Data Cards Playbook, a toolkit for transparency in ML datasets →read more 🛠 Real World MLAnomaly Detection in Prime Video Amazon Science discusses the ML techniques used for anomaly detection in their Prime Video application →read more Einstein Search Answers Salesforce Research discusses the ML techniques powering Einstein Search Answers, a new search architecture for customer support →read more Netflix Video Quality Netflix discusses the neural network techniques used for video encoding optimizations in the media giant →read more 💸 Money in AIML&AI&Data
AI-powered
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
📝 Guest post: Using One Methodology to Solve The Three Failure Modes
Friday, November 18, 2022
In this guest post, Eric Landau, CEO of Encord, discusses the three major model failure modes that prevent models from reaching the production state and solve all three problems with a single
🗣👥 Edge#244: This Google Model Combines Reasoning and Acting in a Single Language Model
Thursday, November 17, 2022
ReAct provides an architecture that triggers actions based on language reasoning paths
📌 Event: apply(recsys)—ML experts from Slack, ByteDance & more share their recommender system learnings
Wednesday, November 16, 2022
Are you building an ML recommender system or planning to? Then you won't want to miss apply(recsys)
🔂 Edge#243: Text-to-Image Synthesis Models – Recap
Tuesday, November 15, 2022
Our longest and the most popular series
☝️CoreWeave to Offer NVIDIA HGX H100 Supercomputers - Supporting Cutting Edge AI & ML Companies*
Monday, November 14, 2022
CoreWeave is proud to be among the first providers to offer cloud instances with NVIDIA HGX H100 supercomputers. NVIDIA's HGX H100 platform represents a major leap forward for the AI community,
You Might Also Like
🎉 Black Friday Early Access: 50% OFF
Monday, November 25, 2024
Black Friday discount is now live! Do you want to master Clean Architecture? Only this week, access the 50% Black Friday discount. Here's what's inside: 7+ hours of lessons .NET Aspire coming
Open Pull Request #59
Monday, November 25, 2024
LightRAG, anything-llm, llm, transformers.js and an Intro to monads for software devs ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Last chance to register: SecOps made smarter
Monday, November 25, 2024
Don't miss this opportunity to learn how gen AI can transform your security workflowsㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤ elastic | Search. Observe. Protect
SRE Weekly Issue #452
Monday, November 25, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: Practice Makes Prepared: Why Every Minor System Hiccup Is Your Team's Secret Training Ground. https://firehydrant.com/blog/the-hidden-
Corporate Casserole 🥘
Monday, November 25, 2024
How marketing and lobbying inspired Thanksgiving traditions. Here's a version for your browser. Hunting for the end of the long tail • November 24, 2024 Hey all, Ernie here with a classic
WP Weekly 221 - Bluesky - WP Assets on CDN, Limit Font Subsets, ACF Pro Now
Monday, November 25, 2024
Read on Website WP Weekly 221 / Bluesky Have you joined Bluesky, like many other WordPress users, a new place for an online social presence? Also in this issue: CrawlWP, Asset Management Framework,
🤳🏻 We Need More High-End Small Phones — Linux Terminal Setup Tips
Sunday, November 24, 2024
Also: Why I Switched From Google Maps to Apple Maps, and More! How-To Geek Logo November 24, 2024 Did You Know Medieval moats didn't just protect castles from invaders approaching over land, but
JSK Daily for Nov 24, 2024
Sunday, November 24, 2024
JSK Daily for Nov 24, 2024 View this email in your browser A community curated daily e-mail of JavaScript news JavaScript Certification Black Friday Offer – Up to 54% Off! Certificates.dev, the trusted
OpenAI's turbulent early years - Sync #494
Sunday, November 24, 2024
Plus: Anthropic and xAI raise billions of dollars; can a fluffy robot replace a living pet; Chinese reasoning model DeepSeek R1; robot-dog runs full marathon; a $12000 surgery to change eye colour ͏ ͏
Daily Coding Problem: Problem #1618 [Easy]
Sunday, November 24, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Zillow. Let's define a "sevenish" number to be one which is either a power