TheSequence - My Five Favorite AI Papers of 2023
Was this email forwarded to you? Sign up here My Five Favorite AI Papers of 2023LLM interpretability, small language models, autonomous agents, API fine-tuning, discovering new algorithmsNext Week in The Sequence:
You can subscribe below!📝 Editorial: My Five Favorite AI Papers of 2023Today marks the final issue of 2023, and I want to start by expressing my gratitude for your support. The Sequence has grown organically to over 165,000 subscribers this year. Thank you all for your continued support. Today's edition will be shorter, as there isn't much content to cover this week. I'd like to highlight five papers that significantly impacted me in 2023. These might not be the papers you'll find receiving awards at top conferences, and I'm sure there are many equally important papers that other experts could mention. My focus is on papers that shifted my perspective on different areas of AI. A quick side note: in 2023, I incubated and raised substantial seed rounds for two different companies in the generative AI space—one in autonomous agents and one in open-source generative AI infrastructure. Both are currently in stealth mode, but I hope to share more details soon. I mention this because the concepts revealed in these papers have influenced some components of these platforms. I've kept the list short to be selective. So here we go:
There are many other papers I could cite, as 2023 was an incredible year for AI research, but the above five were particularly influential in shaping my thinking about AI problems. The Sequence will start strong next year, continuing our series on LLM reasoning. I hope you have had wonderful holidays, and I wish you a blessed new year. Thank you. 🔎 ML ResearchThe Gemini PaperGoogle DeepMind finally published the paper behind their Gemini models. The paper includes detail about the architecture and training processes for Gemini Ultra, Pro and Nano including the optimizaton for different use cases —> Read more. Mini-GPTsAI researchers from MIT published a paper detailing a technique to create Mini-GPTs using. The technique uses architectures such as Microsoft Phi and prunes some components while preserving the key functionality —> Read more. Multimodal Models and In-Context LearningResearchers from the Beijing Academy of Artificial Intelligence pubished a paper introduing Emu2, a 37 billion parameter model capable of complex reasoning via in-context learning. The model seems to match state of the art performance in several multimodal, few-shot, reasoning tasks —> Read more. Vision LLMs and Reinforcement LearningGoogle DeepMind published a paper introducing a very interesting technique that uses vision-language models(VLMs) as a source of rewards for reinforcement learning(RL) agents. The method shows how VLMs can produce rewards for RL agents in visual tasks faster and at a much larger scale than traditional methods —> Read more. 🤖 Cool AI Tech ReleasesPikaText-to-Video platform Pika released its firt version —> Read more. SOLAR-10.7BKorean AI company Upstage open sourced SOLAR-10.7B, a 10.7 billion parameter LLM with impressive performance —> Read more. 📡AI Radar
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
Inside Orca 2: Microsoft's Small Language Model that Outperforms Models 10x Larger in Reasoning Capabilities
Thursday, December 28, 2023
The model innovating in the training procedures to improve reasoning abilities in small language models.
Edge 355: A Taxonomy to Understand LLM Reasoning Methods
Tuesday, December 26, 2023
Not all LLM reasoning methods are created equal. Here are the main categories to understand the different types of LLM reasoning techniques.
Apple GPT is Coming!
Sunday, December 24, 2023
A new research breakthrough outlines the path to run LLMs in IPhones and IPads.
Inside Mixtral 8x7B: One of the Most Exciting Open Source LLM Ever Releases of this Year
Thursday, December 21, 2023
The model follows Mistral 7b with an innovative mixture-of-experts architecture that deviates a bit from monolthical transformer models.
Edge 353: A New Series About Reasoning in Foundation Models
Tuesday, December 19, 2023
We dive into the most important research and technology frameworks in the LLM reasoning space.
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your