TheSequence - Apple GPT is Coming!
Was this email forwarded to you? Sign up here Next Week in The Sequence:
You can subscribe below!📝 Editorial: Apple GPT is ComingWhen we think about tech incumbents that could be severely disrupted by generative AI, Apple often tops the list. While Microsoft, Amazon, NVIDIA, Google, and even Meta have unveiled clear playbooks for their generative AI strategies, the Cupertino giant seems to have dangerously fallen behind in this space. That might soon change… In a somewhat surprising paper titled ‘LLM in a Flash: Efficient Large Language Model Inference with Limited Memory,’ Apple unveiled a new technique to run LLMs on devices with limited DRAM capacity. The cornerstone of this technique is the use of flash storage in mobile devices to store model parameters, loading them on-demand into DRAM. Apple’s method is hyper-optimized to minimize the volume of data transmitted from flash storage, while also transmitting the data in small, continuous chunks. The result allows for running models twice as large as the available DRAM, while also showing a 4.5x increase in inference speed on CPUs and 20-25x on GPUs, respectively. Quite impressive! ‘LLM in a Flash’ outlines a clear path for running sophisticated LLM models on iPhones and iPads, which seems like the natural vehicle for Apple to enter the generative AI space. Maybe we are about to see Apple GPT in the next iOS release after all. 🔎 ML ResearchLLM in a FlashApple Research published a paper outlining a technique for LLM inference with limited memory. The method involves storing the parameters in a flash memory and bringing them on demand to DRAM —> Read more. VideoPoetGoogle Research published a paper detailing VideoPoet, a zero-shot video generation LLM. The model supports a number of video generation tasks such as text-to-video, image-to-video, video stylization, video inpainting and outpainting, and video-to-audio —> Read more. InsightPilotMicrosoft Research published a paper discussing InsightPilot, an LLM-based system for data exploration. The framework takes a dataset as input and triggers a series of LLM-based analytical actions —> Read more. Multi-Step Reasoning AgentGoogle DeepMind published a paper outlining a ReAct-style LLM agent capable of multi-step reasoning. The agent uses reinforcement learning with AI feedback for regularly improvement and self-distillation —> Read more. 🤖 Cool AI Tech ReleasesMidjourney v6A new version of Midjourney is available with a lot of exciting capabilities —> Read more. Stable Video DiffusionStability AI made Stable Video Diffusion available via its developer platform API —> Read more. Titan ModelsAmazon announced the availability of two Titan models in its Bedrock platform —> Read more. 🛠 Real World MLAutoML at LinkedInLinkedIn shares some details about their AutoML architecture used for content abuse detection —> Read more. 📡AI Radar
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
Inside Mixtral 8x7B: One of the Most Exciting Open Source LLM Ever Releases of this Year
Thursday, December 21, 2023
The model follows Mistral 7b with an innovative mixture-of-experts architecture that deviates a bit from monolthical transformer models.
Edge 353: A New Series About Reasoning in Foundation Models
Tuesday, December 19, 2023
We dive into the most important research and technology frameworks in the LLM reasoning space.
Four Releases from Google DeepMind in a Single Week!
Sunday, December 17, 2023
An impressive week by Google DeepMind plus a summary of the top research paper, tech releases and news in the AI space.
The Sequence Chat: Hugging Face's Lewis Tunstall on ZEPHYR , RLHF and LLM Innovation
Friday, December 15, 2023
One of the creators of ZEPHYR discusses ideas and lessons learned building LLMs at scale.
Edge 352: Inside the Embeddings Architecture Powering Job Recommendations at LinkedIn
Friday, December 15, 2023
Some insights about one of the largest embedding architectures ever built.
You Might Also Like
Daily Coding Problem: Problem #1617 [Easy]
Saturday, November 23, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Microsoft. You are given an string representing the initial conditions of some dominoes.
Ranked | The Tallest and Shortest Countries, by Average Height 📏
Saturday, November 23, 2024
These two maps compare the world's tallest countries, and the world's shortest countries, by average height. View Online | Subscribe | Download Our App TIME IS RUNNING OUT There's just 3
⚙️ Your own Personal AI Agent, for Everything
Saturday, November 23, 2024
November 23, 2024 | Read Online Subscribe | Advertise Good Morning. Welcome to this special edition of The Deep View, brought to you in collaboration with Convergence. Imagine if you had a digital
Educational Byte: Are Privacy Coins Like Monero and Zcash Legal?
Saturday, November 23, 2024
Top Tech Content sent at Noon! How the world collects web data Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, November 23, 2024? The HackerNoon
🐍 New Python tutorials on Real Python
Saturday, November 23, 2024
Hey there, There's always something going on over at Real Python as far as Python tutorials go. Here's what you may have missed this past week: Black Friday Giveaway @ Real Python This Black
Re: Hackers may have stolen everyone's SSN!
Saturday, November 23, 2024
I wanted to make sure you saw Incogni's Black Friday deal, which is exclusively available for iPhone Life readers. Use coupon code IPHONELIFE to save 58%. Here's why we recommend Incogni for
North Korean Hackers Steal $10M with AI-Driven Scams and Malware on LinkedIn
Saturday, November 23, 2024
THN Daily Updates Newsletter cover Generative AI For Dummies ($18.00 Value) FREE for a Limited Time Generate a personal assistant with generative AI Download Now Sponsored LATEST NEWS Nov 23, 2024
📧 Building Async APIs in ASP.NET Core - The Right Way
Saturday, November 23, 2024
Building Async APIs in ASP .NET Core - The Right Way Read on: my website / Read time: 5 minutes The .NET Weekly is brought to you by: Even the smartest AI in the world won't save you from a
WebAIM November 2024 Newsletter
Friday, November 22, 2024
WebAIM November 2024 Newsletter Read this newsletter online at https://webaim.org/newsletter/2024/november Features Using Severity Ratings to Prioritize Web Accessibility Remediation When it comes to
➡️ Why Your Phone Doesn't Want You to Sideload Apps — Setting the Default Gateway in Linux
Friday, November 22, 2024
Also: Hey Apple, It's Time to Upgrade the Macs Storage, and More! How-To Geek Logo November 22, 2024 Did You Know Fantasy author JRR Tolkien is credited with inventing the main concept of orcs and