TheSequence - My Five Favorite AI Papers of 2023
Was this email forwarded to you? Sign up here My Five Favorite AI Papers of 2023LLM interpretability, small language models, autonomous agents, API fine-tuning, discovering new algorithmsNext Week in The Sequence:
You can subscribe below!📝 Editorial: My Five Favorite AI Papers of 2023Today marks the final issue of 2023, and I want to start by expressing my gratitude for your support. The Sequence has grown organically to over 165,000 subscribers this year. Thank you all for your continued support. Today's edition will be shorter, as there isn't much content to cover this week. I'd like to highlight five papers that significantly impacted me in 2023. These might not be the papers you'll find receiving awards at top conferences, and I'm sure there are many equally important papers that other experts could mention. My focus is on papers that shifted my perspective on different areas of AI. A quick side note: in 2023, I incubated and raised substantial seed rounds for two different companies in the generative AI space—one in autonomous agents and one in open-source generative AI infrastructure. Both are currently in stealth mode, but I hope to share more details soon. I mention this because the concepts revealed in these papers have influenced some components of these platforms. I've kept the list short to be selective. So here we go:
There are many other papers I could cite, as 2023 was an incredible year for AI research, but the above five were particularly influential in shaping my thinking about AI problems. The Sequence will start strong next year, continuing our series on LLM reasoning. I hope you have had wonderful holidays, and I wish you a blessed new year. Thank you. 🔎 ML ResearchThe Gemini PaperGoogle DeepMind finally published the paper behind their Gemini models. The paper includes detail about the architecture and training processes for Gemini Ultra, Pro and Nano including the optimizaton for different use cases —> Read more. Mini-GPTsAI researchers from MIT published a paper detailing a technique to create Mini-GPTs using. The technique uses architectures such as Microsoft Phi and prunes some components while preserving the key functionality —> Read more. Multimodal Models and In-Context LearningResearchers from the Beijing Academy of Artificial Intelligence pubished a paper introduing Emu2, a 37 billion parameter model capable of complex reasoning via in-context learning. The model seems to match state of the art performance in several multimodal, few-shot, reasoning tasks —> Read more. Vision LLMs and Reinforcement LearningGoogle DeepMind published a paper introducing a very interesting technique that uses vision-language models(VLMs) as a source of rewards for reinforcement learning(RL) agents. The method shows how VLMs can produce rewards for RL agents in visual tasks faster and at a much larger scale than traditional methods —> Read more. 🤖 Cool AI Tech ReleasesPikaText-to-Video platform Pika released its firt version —> Read more. SOLAR-10.7BKorean AI company Upstage open sourced SOLAR-10.7B, a 10.7 billion parameter LLM with impressive performance —> Read more. 📡AI Radar
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
Inside Orca 2: Microsoft's Small Language Model that Outperforms Models 10x Larger in Reasoning Capabilities
Thursday, December 28, 2023
The model innovating in the training procedures to improve reasoning abilities in small language models.
Edge 355: A Taxonomy to Understand LLM Reasoning Methods
Tuesday, December 26, 2023
Not all LLM reasoning methods are created equal. Here are the main categories to understand the different types of LLM reasoning techniques.
Apple GPT is Coming!
Sunday, December 24, 2023
A new research breakthrough outlines the path to run LLMs in IPhones and IPads.
Inside Mixtral 8x7B: One of the Most Exciting Open Source LLM Ever Releases of this Year
Thursday, December 21, 2023
The model follows Mistral 7b with an innovative mixture-of-experts architecture that deviates a bit from monolthical transformer models.
Edge 353: A New Series About Reasoning in Foundation Models
Tuesday, December 19, 2023
We dive into the most important research and technology frameworks in the LLM reasoning space.
You Might Also Like
Daily Coding Problem: Problem #1617 [Easy]
Saturday, November 23, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Microsoft. You are given an string representing the initial conditions of some dominoes.
Ranked | The Tallest and Shortest Countries, by Average Height 📏
Saturday, November 23, 2024
These two maps compare the world's tallest countries, and the world's shortest countries, by average height. View Online | Subscribe | Download Our App TIME IS RUNNING OUT There's just 3
⚙️ Your own Personal AI Agent, for Everything
Saturday, November 23, 2024
November 23, 2024 | Read Online Subscribe | Advertise Good Morning. Welcome to this special edition of The Deep View, brought to you in collaboration with Convergence. Imagine if you had a digital
Educational Byte: Are Privacy Coins Like Monero and Zcash Legal?
Saturday, November 23, 2024
Top Tech Content sent at Noon! How the world collects web data Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, November 23, 2024? The HackerNoon
🐍 New Python tutorials on Real Python
Saturday, November 23, 2024
Hey there, There's always something going on over at Real Python as far as Python tutorials go. Here's what you may have missed this past week: Black Friday Giveaway @ Real Python This Black
Re: Hackers may have stolen everyone's SSN!
Saturday, November 23, 2024
I wanted to make sure you saw Incogni's Black Friday deal, which is exclusively available for iPhone Life readers. Use coupon code IPHONELIFE to save 58%. Here's why we recommend Incogni for
North Korean Hackers Steal $10M with AI-Driven Scams and Malware on LinkedIn
Saturday, November 23, 2024
THN Daily Updates Newsletter cover Generative AI For Dummies ($18.00 Value) FREE for a Limited Time Generate a personal assistant with generative AI Download Now Sponsored LATEST NEWS Nov 23, 2024
📧 Building Async APIs in ASP.NET Core - The Right Way
Saturday, November 23, 2024
Building Async APIs in ASP .NET Core - The Right Way Read on: my website / Read time: 5 minutes The .NET Weekly is brought to you by: Even the smartest AI in the world won't save you from a
WebAIM November 2024 Newsletter
Friday, November 22, 2024
WebAIM November 2024 Newsletter Read this newsletter online at https://webaim.org/newsletter/2024/november Features Using Severity Ratings to Prioritize Web Accessibility Remediation When it comes to
➡️ Why Your Phone Doesn't Want You to Sideload Apps — Setting the Default Gateway in Linux
Friday, November 22, 2024
Also: Hey Apple, It's Time to Upgrade the Macs Storage, and More! How-To Geek Logo November 22, 2024 Did You Know Fantasy author JRR Tolkien is credited with inventing the main concept of orcs and