TheSequence - Apple Goes Small and Super Multimodal
Was this email forwarded to you? Sign up here Apple Goes Small and Super MultimodalPlus a lot of new models being released and quite an active week for AI VCs.Next Week in The Sequence:
You can subscribe to The Sequence below:📝 Editorial: Apple Goes Small and Super MultimodalApple has been late to the generative AI game, but lately, it has been pushing the research agenda quite hard. Apple has an ideal playground for innovating in one of the hottest areas of the next wave of generative AI: on-device multimodal models. The idea of powering mobile AI through API integrations with massively large foundation models seems highly impractical and insecure, and Apple is in a unique position to power alternatives to this paradigm. However, most of Apple’s efforts in small on-device models have been somewhat underwhelming. That definitely changed last week. Building on recent research, Apple published a paper and unveiled a demonstration of 4M-21, an any-to-any vision model trained for many tasks and modalities. 4M-21 builds on the original 4M research; the new model expands from 7 to 21 modalities, including highly specific ones such as edges, geometric, semantic, and feature maps. Perhaps the biggest contribution of 4M-21 is the ability to train a single model across language and vision simultaneously without virtually sacrificing performance. The secret? A series of modality-specific tokenizers that not only optimize multimodal learning but do so without the known challenges of large models. 4M-21 is impressive, and its approach seems to address the key first principles of the type of models needed in iOS devices. In case it's not obvious, Apple is no longer quiet in generative AI. 🔎 ML ResearchA Small Any-to-Any ModelApple Research published a paper introducing 4M-21, a small multimodal model optimized for tens of different tasks and modalities. The core innovation of 4M-21 is to train on a diverse set of modalities without sacrificing performance —> Read more. Generating Function Calling DatasetsSalesforce Research published the paper and source code for APIGen, a pipeline for creating function calling datasets. APIGen emphasizes on teh verifiability and diversity of the datasets which drastically improves the performance of LLMs in function calling tasks —> Read more. RouteLLMLMSys published a paper and open source code for RouteLLM, a new technique that selects model at inference time based on performance. RouteLLM provides the training mechanisms to build routers based on human preferences or data augmentation —> Read more. New Model EvalsAnthropic published an insightful post outlining a new initiative for creating third party evaluations for foundation models. The post discusses the requirements and challenges to new evaluations and its relevance for improving foundation models —> Read more. Text to 3DMeta AI published a paper introducing 3D Gen, a pipeline for text-to-3D asset generation. The method combines two key components: AssetGen and TextureGen to generate high fidelity 3D assets in under a minute —> Read more. Unlearning is not EnoughGoogle DeepMind published a paper introducing unUnlearning, a technique to reintroduce unlearned knowledge in context in a way that makes LLMs behaves like they know the forgotten knowledge. The paper argues unLearning is not enough for content regulation and new techniques are required —> Read more. Summarization ChallengesSalesforce Research published a paper outlining Summary of a Haystack( SumHay), a challenge for evaluating summarization in long context LLMs and RAG systems. SumHay uses a carefully crafted set of documents with repeated insights and evaluates LLM summaries in the effectiveness of citing those insights —> Read more. 🤖 Cool AI Tech ReleasesGemma2Google released 9B and 27B versions of its Gemma 2 small LLMs —> Read more. MoshiFrench AI lab Kyutai released Moshi, an open source GPT-4o alternative —> Read more. Multi Token PredictionMeta AI released a series of baseline models using its multi-token prediction techniques —> Read more. XLAMSalesforce open sourced xLAM, a small LLM optimized for function calling —> Read more. GraphRAGMicrosoft open sourced GraphRAG, a framework to create knowledge graphs over private datasets —> Read more. Eval DatasetsImbue open sourced a series of sanitized datasets to evaluate reasoning and coding tasks in LLMs —> Read more. 🛠 Real World AIRAG in Production at WalmartWalmart Global Tech discusses best practices for building production-ready RAG systems —> Read more. Spark-EMR at SlackSlack discusses their journey improving their big data infrastructure for ML workloads —> Read more. 📡AI Radar
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
Edge 410: Learn About Virtual Token Counter: A Novel Method that Address One of the Major Challenges LLM Serving
Thursday, July 4, 2024
Created by UC Berkeley and Stanford University, VTC introduced a fairness in LLM serving scheduling ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Edge 409: Augmenting Autonomous Agents with Long-Term Memory
Tuesday, July 2, 2024
Making agents remember beyond the context window. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
📝 Guest Post: Yandex develops and open-sources YaFSDP — a tool for faster LLM training and optimized GPU consumpt…
Monday, July 1, 2024
A few weeks ago, Yandex open-sourced the YaFSDP method — a new tool that is designed to dramatically speed up the training of large language models. In this article, Mikhail Khrushchev, the leader of
The Single-Algorithm AI Chip
Sunday, June 30, 2024
Plus a tremendous activity in funding activity in generative AI startups. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
📝 Guest Post: Designing Prompts for LLM-as-a-Judge Model Evals*
Friday, June 28, 2024
In this guest post, Nikolai Liubimov, CTO of HumanSignal provides helpful resources to get started building LLM-as-a-judge evaluators for AI models. HumanSignal recently launched a suite of tools
You Might Also Like
Dark forest, bad art and paying to bike
Saturday, December 28, 2024
Neologism #24, 28.12.2024 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Weekend Reading — Happy New Year! 🥳
Saturday, December 28, 2024
Vitalis 🇺🇦 The most original and unusual landmark in Odesa, which has become a symbol of the creativity of Odesa residents. Tech Stuff Cursor I really really like Cursor. I had a great time using VS
Daily Coding Problem: Problem #1651 [Hard]
Saturday, December 28, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Apple. You are going on a road trip, and would like to create a suitable music playlist.
📺 There's Still A Place for Universal Remotes — 10 Apps I Always Install on a New Mac
Saturday, December 28, 2024
Also: How to Add Emails to Your Tasks To-Do List in Gmail, and More! How-To Geek Logo December 28, 2024 Did You Know In December 2014, two con men from Girona, Spain, agreed to sell a fake Francisco de
Ranked | The World's Top 30 Countries, by Automobiles Manufactured 🚙
Saturday, December 28, 2024
In 2023, China led global car production, contributing nearly a third of total output. Which countries followed in this competitive industry? View Online | Subscribe | Download Our App FEATURED STORY
🐍 New Python tutorials on Real Python
Saturday, December 28, 2024
Hey there, There's always something going on over at Real Python as far as Python tutorials go. Here's what you may have missed this past week: Learn From 2024's Most Popular Python
15,000+ Four-Faith Routers Exposed to New Exploit Due to Default Credentials
Saturday, December 28, 2024
THN Daily Updates Newsletter cover Resilient Cybersecurity ($39.99 Value) FREE for a Limited Time Reconstruct your defense strategy in an evolving cyber world Download Now Sponsored LATEST NEWS Dec 28,
Hands Down One Of The Best Cards For 2025 Offering 0% interest until 2026
Saturday, December 28, 2024
iPhoneLife Logo Sponsored email sent by iPhone Life Hands Down One Of The Best Cards For 2025 Offering 0% interest until 2026 If you have outstanding credit card debt, getting a new 0% intro APR credit
📧 What Rewriting a 40-Year-Old Project Taught Me About Software Development
Saturday, December 28, 2024
What Rewriting a 40-Year-Old Project Taught Me About Software Development Read on: my website / Read time: 7 minutes The .NET Weekly is brought to you by: As the year wraps up, it's clear API
This Week in Rust #579
Saturday, December 28, 2024
Email isn't displaying correctly? Read this e-mail on the Web This Week in Rust issue 579 — 25 DEC 2024 Hello and welcome to another issue of This Week in Rust! Rust is a programming language