Beyond OpenAI: Apple’s On-Device AI Strategy
Was this email forwarded to you? Sign up here Beyond OpenAI: Apple’s On-Device AI StrategyPlus a new super coder model, Meta’s new AI releases, DeepMind’s video-to-audio models and much more.Next Week in The Sequence:
You can subscribe to The Sequence below:📝 Editorial: Beyond OpenAI: Apple’s On-Device AI StrategyThe partnership between Apple and OpenAI dominated the headlines of the recent WWDC conference and sparked passionate debates within the AI community. Some view this partnership as a way to enable best-in-class AI in iOS devices, while others consider it a consequence of Apple’s lack of readiness to build proprietary AI models. The latter assumption would be a mistake. While it is true that Apple hasn’t historically developed the same AI research talent as Microsoft, Google, or Meta, things are rapidly changing, and last week was a validation of that. A few days ago, Apple open-sourced 20 small models and 4 datasets across different language and image categories. The models are compatible with Apple’s CoreML framework, designed to run on-device models. In the new release, we can find models such as FastViT for image classification, DepthAnything for monocular depth estimation, and DETR for semantic segmentation. These models can run on-device without the need for an Internet connection. The demand for smaller foundation models that can run on edge devices continues to grow. Several factors are contributing to this. First, mobile and IoT devices represent a significant percentage of user-computer interactions and, as a result, a fertile ground for AI. Computational constraints, personalization requirements, and privacy and security challenges prevent these scenarios from being solved by mega-large foundation models. Apple has one of the largest distribution channels for on-device models and, consequently, it shouldn’t be a surprise that it is advancing research in that area. Thinking that Apple’s AI strategy is dependent on the partnership with OpenAI would be a mistake. On-device AI is going to be a relevant trend, and Apple one of its main influencers. 🔎 ML ResearchDeepMind V2AGoogle DeepMind published the research related to their video-to-audio(V2A) models. V2A combines video pixels and prompts to generate rich soundtracks that complement the video clips —> Read more. DeepSeek-Coder-V2DeepSeek published the research behind DeepSeek-Coder-v2, a mixture-of-experts(MoE) architecture optimized for coding and math reasoning. DeepSeek-Coder-v2 supports 338 programming languages and shows performance comparable to GPT-4o in coding tasks —> Read more. Vulnerabilities in Multimodal AgentsResearchers from Carnegie Mellon University(CMU) published a paper outlining a series of adversarial attacks in vision-language agents. The research use adversarial prompts to perturb the model gradients and guide it to take the wrong actions —> Read more. DataCompResearchers from several research labs including University of Washington and Apple published a paper unveiling DataComp for Language Models (DCLM), a method for creating high quality training datasets for foundation models. DCLM also introduces a corpus of 240T tokens extracted from Common Crawl and 53 evaluation recipes —> Read more. Task-Me-AnythingResearchers from the University of Washington and Allen AI published a paper outlining Task-Me-Anything, a technique that is able to generate a benchmark tailored to user’s needs.. The method is optimized for multimodal models and maintains a library of assets across different mediums( videos, images, 3Ds) and combines them to generate new benchmarks —> Read more. Whiteboard-of-ThoughtResearchers from Columbia University published a paper introducing Whiteboard-of-Thought a reasoning method for multimodal models. The core idea is to prompt models with a metaphorical whiteboard that illustrates reasoning steps visually —> Read more. 🤖 Cool AI Tech ReleasesMeta New ModelsMeta released new models for audio, watermarking, multi-token prediction, images and others —> Read more. Apple New ModelsApple released a series of small models for language and image capabilities —> Read more. Claude 3.5 SonnetAnthropic released Claude 3.5 Sonnet which exhibit strong performance at much faster speed —> Read more. Gen-3 AlphaRunway unveiled Gen-3 Alpha, its new video generation model with tangible fidelity and consistency improvements over its predecessors —> Read more. AutoGen StudioMicrosoft released AutoGen Studio, a low-code interface for building and testing multi-agent solutions —> Read more. BigCodeBenchHuggingFace released BigCodeBench, a new benchmark specialized in code-generation tasks —> Read more. 🛠 Real World AIModel Training at MetaMeta shares some details about the infrastructure used to scale the training of foundation models internally —> Read more. Video Classification at NetflixNetflix discusses their use of vision-language models and active learning to build video classifiers —> Read more. 📡AI Radar
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
Edge 404: Inside Anthropic's Dictionary Learning, A Breakthrough in LLM Interpretability
Thursday, June 20, 2024
Arguably one of the most important papers of 2024 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The Sequence Chat: Justin D. Harris - About Building Microsoft CoPilot
Wednesday, June 19, 2024
Quick bio This is your second interview at The Sequence. Please tell us a bit about yourself. Your background, current role and how did you get started in AI? I grew up in the suburbs of Montreal and I
Edge 405: Memory and Autonomous Agents
Tuesday, June 18, 2024
Augmenting autonomous agents capabilities with different memory architectures can lead to amazing capabilities. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
📽 [Virtual talk] Build hyper-personalized product experiences with Full RAG
Monday, June 17, 2024
Hey there, Want to build highly personalized product experiences? Building them with traditional RAG (Retrieval-Augmented Generation) alone is tough, especially when it comes to adding real-time and
Amazing Dream Machine
Sunday, June 16, 2024
A text-to-video model freely available to everyone. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
You Might Also Like
WebAIM November 2024 Newsletter
Friday, November 22, 2024
WebAIM November 2024 Newsletter Read this newsletter online at https://webaim.org/newsletter/2024/november Features Using Severity Ratings to Prioritize Web Accessibility Remediation When it comes to
➡️ Why Your Phone Doesn't Want You to Sideload Apps — Setting the Default Gateway in Linux
Friday, November 22, 2024
Also: Hey Apple, It's Time to Upgrade the Macs Storage, and More! How-To Geek Logo November 22, 2024 Did You Know Fantasy author JRR Tolkien is credited with inventing the main concept of orcs and
JSK Daily for Nov 22, 2024
Friday, November 22, 2024
JSK Daily for Nov 22, 2024 View this email in your browser A community curated daily e-mail of JavaScript news React E-Commerce App for Digital Products: Part 4 (Creating the Home Page) This component
Spyglass Dispatch: The Fate of Chrome • Amazon Tops Up Anthropic • Pros Quit Xitter • Brave Powers AI Search • Apple's Lazy AI River • RIP Enrique Allen
Friday, November 22, 2024
The Fate of Chrome • Amazon Tops Up Anthropic • Pros Quit Xitter • Brave Powers AI Search • Apple's Lazy AI River • RIP Enrique Allen The Spyglass Dispatch is a free newsletter sent out daily on
Charted | How the Global Distribution of Wealth Has Changed (2000-2023) 💰
Friday, November 22, 2024
This graphic illustrates the shifts in global wealth distribution between 2000 and 2023. View Online | Subscribe | Download Our App Presented by: MSCI >> Get the Free Investor Guide Now FEATURED
Daily Coding Problem: Problem #1616 [Easy]
Friday, November 22, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Alibaba. Given an even number (greater than 2), return two prime numbers whose sum will
The problem to solve
Friday, November 22, 2024
Use problem framing to define the problem to solve This week, Tom Parson and Krishna Raha share tools and frameworks to identify and address challenges effectively, while Voltage Control highlights
Issue #568: Random mazes, train clock, and ReKill
Friday, November 22, 2024
View this email in your browser Issue #568 - November 22nd 2024 Weekly newsletter about Web Game Development. If you have anything you want to share with our community please let me know by replying to
Whats Next for AI: Interpreting Anthropic CEOs Vision
Friday, November 22, 2024
Top Tech Content sent at Noon! How the world collects web data Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, November 22, 2024? The HackerNoon
iOS Cocoa Treats
Friday, November 22, 2024
View in browser Hello, you're reading Infinum iOS Cocoa Treats, bringing you the latest iOS related news straight to your inbox every week. Using the SwiftUI ImageRenderer The SwiftUI ImageRenderer