I Promise, this Editorial is NOT About OpenAI
Was this email forwarded to you? Sign up here I Promise, this Editorial is NOT About OpenAISome major milestones in generative video were announced this week.Next Week in The Sequence:
You can subscribe below:📝 Editorial: I Promise, this Editorial is NOT About OpenAII intentionally plan to avoid discussing the recent events at OpenAI in this weekend's editorial. There are plenty of other AI newsletters on the planet offering various opinionated takes, even without all the facts. I prefer to wait until more information is available before forming an opinion. Here are just a couple of key points:
Before the drama unfolded Friday, I had written today's editorial about the progress in generative video. Let's keep it brief 😉. Long considered one of the most challenging areas for generative AI, video creation is quickly becoming a new frontier in the field. Generative video models must integrate concepts such as movement, physical reactions, time alignment, and interactions between objects, which are not required in traditional image scenarios. Additionally, the number of video datasets is relatively small compared to those for text, images, or audio. Not surprisingly, the video space has lagged behind other generative AI domains. But this is rapidly changing. The volume and quality of research in generative video are swiftly increasing. Just this week, Meta and Google published new work in this area. Meta AI unveiled their advancements in Emu Video and Emu Edit, marking significant milestones in generative video. Emu Video is a high-quality text-to-video model that generates images from a text prompt and then short videos based on both the text and the images. Emu Edit is an image editing model capable of transforming images based on textual instructions, suitable for both global and local edits. Also this week, Google Research released a paper on Mirasol3B, a model for the multimodal understanding of long-form videos. Mirasol3B consists of two autoregressive models that infer information from different modalities such as video, audio, or text present in long-form videos. Initial results show Mirasol3B achieving new milestones in video question-answering benchmarks. Video is emerging as one of the new frontiers in generative AI. Ironically, this is an area where OpenAI has not particularly excelled. 🔎 ML ResearchEmu Video and Emu EditMeta AI published papers outlining Emu Video and Emu Edit which represents their latest research in generative video generation and edition respectively. Both models are based on Emu, Meta AI’s first image generation model —> Read more. Long Video UnderstandingGoogle Research published a paper proposing Mirasol3B, a multimodal model that can learn long forms of text, audio and video. The main innovation of Mirasol3B is that it decouples the learning into different autoregressive models which allow higher levels of specialization —> Read more. Optimizing Models for Different HardwareAmazon Science published a detailed analysis of the techniques used to optimize neural architecture search(NAS) models across different hardware. The process includes aspects such as curating the search space and incorporating human feedback —> Read more. Weather ForecastingGoogle DeepMind published a paper detailing GraphCast, a weather forecasting model. GraphCast is able to predict weather conditions up to 10 days in advance beating the state-of-the-art models in both accuracy and cost —> Read more. GhostbusterBerkeley AI Research(BAIR) published a paper proposing Ghostbuster, a techique for detecting AI generated content. Ghostbuster uses LLMs to determine the probability of generating each token in a document and then combines those results in a final classifier —> Read more. 🤖 Cool AI Tech ReleasesLyriaGoogle DeepMind and Youtube collaborated on building Lyria, an advanced music generation model as well as a set of music AI tools —> Read more. Microsoft AI ReleasesMicrosoft announced numerous AI releases at its Ignite conference —> Read more. LlamaIndex 0.9The new release of LlamaIndex is here with quite a group of new feature —> Read more. NVIDIA AI Foundry ServiceNVIDIA announced the release of the Foundry family of foundation models in partnership with Microsoft Azure —> Read more. 🛠 Real World MLGetting Started with Llama 2Meta AI published an step by step process to get started with Llama 2 —> Read more. 📡AI Radar
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
😎 Private Preview: Build Real-Time AI Applications Using Only Python
Friday, November 17, 2023
Our friends from Tecton launched a new, AI-optimized, Python-based compute engine called Rift. Now you can build real-time AI applications in minutes! Using Tecton with Rift, you can: Build better
Edge 344: LLMs and Memory is All You Need. Inside One of the Most Shocking Papers of the Year
Friday, November 17, 2023
Can memory-augmented LLMs simulate any algorithm?
Edge 343: Understanding Llama-Adapter Fine-Tuning
Tuesday, November 14, 2023
One of the most intriguing fine-tuning methods that combines prefix-tuning and PEFT.
📝 Guest Post: Comparing Vector Databases, Vector Search Libraries, and Vector Search Plugins*
Monday, November 13, 2023
In this guest post, Frank Liu, Head of ML&AI at Zilliz, explores the intricate realm of vector search, comparing vector databases, vector search plugins, and vector search libraries. Let's dive
OpenAI is Starting to Look Like Apple in 2008
Sunday, November 12, 2023
Some non obvious views on the most important announcement of the week.
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your