TheSequence - 📽🎥 Meta AI’s Make-A-Video
Was this email forwarded to you? Sign up here 📝 EditorialGenerative models based on textual inputs are experiencing tremendous momentum. Models such as DALL-E, Midjourney, and Stable Diffusion have captured the imagination of not only the AI community but artists, designers, gamers, and creative minds across many different domains. When thinking about the next milestone for text-to-image synthesis models, video creation is often cited on the top of the list. Obviously, video generation presents significant challenges compared to static images. For starters, video requires significantly more training resources, and there are very few high-quality datasets available that works with supervised methods. Also, the feature representation space of videos is considerably more complex than images. Just like text-to-image, recently text-to-video has turned to unsupervised pretrained methods. A few days ago, Meta AI took a very important step in advancing text-to-video synthesis with the unveiling of Make-A-Video, a model able to generate high-quality videos from textual inputs. Make-A-Video follows the announcement of Make-A-Scene, a photorealistic text-to-image synthesis model. Make-A-Video learns the correspondence between text, visual, and movement from unsupervised video data. Arguably, the biggest contribution of Make-A-Video is the fact that the model doesn’t require text-video pairs for its training. Just from processing large amounts of video content, Make-A-Video is able to infer how different objects move and interact. Part of this innovation comes from leveraging text-image priors. Meta AI didn’t release a version of Make-A-Video as its still understanding the ethical concerns around these type of models, but the website indicates that a limited release might be available soon. Make-A-Video is an indication that a new wave of text-to-video synthesis models is around the corner. 🔺🔻TheSequence Scope – our Sunday edition with the industry’s development overview – is free. To receive high-quality content about the most relevant developments in the ML world every Tuesday and Thursday, please subscribe to TheSequence Edge 🔺🔻 🗓 Next week in TheSequence Edge: Edge#231: we explore Text-to-image synthesis with GANs; discuss Google’s XMC-GAN, a modern approach to text-to-image synthesis; explore NVIDIA GauGAN2 Demo. Edge#232: we deep dive into DeepMind’s new method for discovering when an agent is present in a system 📌 Feature Store Summit 2022: A free conference on Feature EngineeringWe highly recommend the second Feature Store Summit, a free online conference for Feature Engineering and managing data for AI organized by Hopsworks. Join them October 11th, 2022! This year's talks and sessions revolve around the theme of 'Accelerating Production Machine Learning with Feature Stores' from companies such as Uber, Linkedin, Airbnb, Doordash, Disney Streaming and many more. By joining the event you will be able to hear from people who have seen the good, bad and ugly side of the feature stores and learn from their experiences. It will help you to understand the capabilities of a Feature Store and the various cutting-edge technologies that facilitate bringing ML models into production, as well as showcase ways to improve your ML platforms. Now, let’s review the most important developments in the AI industry this week 🔎 ML ResearchMake-A-Video Meta AI published a paper detailing Make-A-Video, a text-to-video synthesis model that can produce short, high-quality video clips from textual inputs →read more Fast and Sustainable Reinforcement Learning Google Research published a paper unveiling ActorQ, a method for accelerating the training and efficiency of RL agents →read more Alexa’s Interactive Story Creation Amazon Science published a detailed article about the ML techniques powering Alexa’s new interactive story creation features →read more AI Systems Performance Evaluation Microsoft Research published an article detailing the techniques and best practices used to evaluate the performance of the AI systems powering PeopleLens, a solution behind social interaction for blind children →read more 🤖 Cool AI Tech ReleasesBigCode Hugging Face and ServiceNow Research partnered to launch BigCode, a project that aims to build large language models for coding →read more SetFit Hugging Face and Intel Labs open-sourced SetFit, a framework for few-shot fine-tuning of Sentence Transformers →read more PySyTFF TensorFlow and OpenMined collaborated to launch PySyTFF, a new framework for privacy-preserving ML →read more Venice LinkedIn open sourced Venice, a derived data platform for high-throughput, low latency datasets →read more Freely Available DALL-E OpenAI enabled access to DALL-E without a waitlist →read more 🛠 Real World MLTrillion Parameter Scalability at AWS Amazon Science discusses the techniques and architecture used to scale the training of large ML models to over one trillion parameters →read more Data Warehousing at Airbnb Airbnb discusses the architecture and processes used to upgrade their data warehousing infrastructure →read more 💸 Money in AIML&AI
AI-powered
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
📌Join industry leaders at the ML:Integrity conference / Oct 19
Friday, September 30, 2022
ML:Integrity is a free, virtual conference dedicated to advancing machine learning integrity. Join dozens of leading executives, policy makers, and academics for a day of talks on Wednesday, October 19
1️⃣0️⃣0️⃣0️⃣ Edge#230: How Amazon Scaled Alexa to 1000 Languages
Thursday, September 29, 2022
Self-Supervised pretraining, transfer learning and knowledge distillation were among the techniques used to scale Alexa across many languages
🍂 It's FALL: Subscribe for ONLY $35/year
Wednesday, September 28, 2022
For a limited time, we offer 30% OFF on our Premium content that helps you be on top of the AI knowledge, up-to-date with all the ML developments, and be smarter than your colleagues ;) With your help,
🌅 Edge#229: VQGAN + CLIP
Tuesday, September 27, 2022
+the original VQGAN+CLIP paper; +VQGAN+CLIP implementations
📝 Guest post: 4 Types of ML Data Errors You Can Fix Right Now*
Monday, September 26, 2022
In this article, Galileo founding engineer Nikita Demir discusses common data errors that NLP teams run into and how Galileo helps fix these errors in minutes with a few lines of code. A very helpful
You Might Also Like
Is there more to your iPhone?
Monday, November 25, 2024
Have you ever wondered if there's more to your iPhone than meets the eye? Maybe you've been using it for years, but certain powerful features and settings remain hidden. That's why we'
🎉 Black Friday Early Access: 50% OFF
Monday, November 25, 2024
Black Friday discount is now live! Do you want to master Clean Architecture? Only this week, access the 50% Black Friday discount. Here's what's inside: 7+ hours of lessons .NET Aspire coming
Open Pull Request #59
Monday, November 25, 2024
LightRAG, anything-llm, llm, transformers.js and an Intro to monads for software devs ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Last chance to register: SecOps made smarter
Monday, November 25, 2024
Don't miss this opportunity to learn how gen AI can transform your security workflowsㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤ elastic | Search. Observe. Protect
SRE Weekly Issue #452
Monday, November 25, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: Practice Makes Prepared: Why Every Minor System Hiccup Is Your Team's Secret Training Ground. https://firehydrant.com/blog/the-hidden-
Corporate Casserole 🥘
Monday, November 25, 2024
How marketing and lobbying inspired Thanksgiving traditions. Here's a version for your browser. Hunting for the end of the long tail • November 24, 2024 Hey all, Ernie here with a classic
WP Weekly 221 - Bluesky - WP Assets on CDN, Limit Font Subsets, ACF Pro Now
Monday, November 25, 2024
Read on Website WP Weekly 221 / Bluesky Have you joined Bluesky, like many other WordPress users, a new place for an online social presence? Also in this issue: CrawlWP, Asset Management Framework,
🤳🏻 We Need More High-End Small Phones — Linux Terminal Setup Tips
Sunday, November 24, 2024
Also: Why I Switched From Google Maps to Apple Maps, and More! How-To Geek Logo November 24, 2024 Did You Know Medieval moats didn't just protect castles from invaders approaching over land, but
JSK Daily for Nov 24, 2024
Sunday, November 24, 2024
JSK Daily for Nov 24, 2024 View this email in your browser A community curated daily e-mail of JavaScript news JavaScript Certification Black Friday Offer – Up to 54% Off! Certificates.dev, the trusted
OpenAI's turbulent early years - Sync #494
Sunday, November 24, 2024
Plus: Anthropic and xAI raise billions of dollars; can a fluffy robot replace a living pet; Chinese reasoning model DeepSeek R1; robot-dog runs full marathon; a $12000 surgery to change eye colour ͏ ͏