͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏

Forwarded this email? Subscribe here for more

Was this email forwarded to you? Sign up here

The Reasoning Race: Can Small Models Reason?

And Some Major Changes in The Sequence you shuld read about.

Jan 5

READ IN APP

A note to all subscribers:

Welcome to The Sequence 2025! I’ve been eagerly waiting for the end of the year to propose some changes that I believe will tremendously improve the experience for you, the readers of this newsletter.

We started The Sequence a few years ago as a hobby project at a time when AI had yet to reach the mainstream levels of popularity it enjoys today. Little did I suspect that we would approach 200,000 subscribers, including members from some of the world’s top AI organizations.

What has always set The Sequence apart from other newsletters is its focus on deep technical content and original ideas rather than chasing news or hype. Over the past few weeks, I analyzed current readership patterns and came away with some important insights:

The Sequence should feature content targeted at both AI scientists and engineers, covering both research and implementation topics.
The educational series are highly popular, but readers sometimes skip the engineering sections because they’re too long or unrelated to the main topic.
Our original and controversial topics, explored in longer pieces, consistently attract the highest readership, which I love.
While we’re bombarded with sponsorship requests, I’ve decided to slow things down and work out a non-invasive strategy that aligns better with our audience.
Interviews have received very positive feedback.
Some of the branding, like The Edge and The Scope, feels outdated.

With this in mind, in 2025, we’re nearly doubling our coverage with the following weekly editions:

The Sequence Knowledge: Continuing with educational topics and related research. We’re kicking off an exciting series on RAG and have others lined up on evaluations, decentralized AI, code generation, and more.
The Sequence Engineering: A standalone edition dedicated to engineering topics such as frameworks, platforms, and case studies. I’ve started three AI companies in the last 18 months so have a lot of opinions about engineering topics.
The Sequence Chat: Our interview series featuring researchers and practitioners in the AI space.
The Sequence Research: Covering current research papers.
The Sequence Insights: Weekly essays on deep technical or philosophical topics related to AI.
The Sequence Radar: Our Sunday edition covering news, startups, and other relevant topics.

That’s six editions! Each one will be relatively short and easy to follow. We’re starting with this new structure next week!

We’re also discussing potential price changes for new subscribers (current subscribers won’t be affected), so I encourage you to subscribe in the next few days if you haven’t already.

To the companies reaching out with sponsorship opportunities: thank you for your patience. We’ll have something to discuss soon.

I hope you love these changes. If nothing else, they should make The Sequence even more enjoyable while doubling down on what we do best.

Thanks for your continued support.

Jesus Rodriguez.

Now, onto today’s edition! As mentioned before, we are not doing our typical Sunday edition given the limited market activity during the holidays and, instead, we are discussing another controversial AI topic.

Is Reasoning Exclusive to Massive Models or do Small Models Have a Chance?

The rapid advancement of artificial intelligence has led to the emergence of large language models (LLMs) like GPT-01 and o3, which exhibit remarkable reasoning capabilities. However, the prevailing notion suggests that such reasoning is primarily confined to models with extensive parameter counts, often exceeding hundreds of billions. This essay explores whether small language models (SLMs) can develop reasoning abilities comparable to their larger counterparts. We will discuss the nature of reasoning in LLMs, the techniques that enhance reasoning in SLMs, and challenge the assumption that reasoning is exclusive to larger models.

Understanding Reasoning in Large Language Models...

Subscribe to TheSequence to unlock the rest.

Become a paying subscriber of TheSequence to get access to this post and other subscriber-only content.

A subscription gets you:

	Full access to TheSequence Edge – what's new in AI + the most relevant ML concepts, research papers, tech solutions
	Full archive
	Comments and discussions

Like

Comment

Restack

The Reasoning Race: Can Small Models Reason?

The Reasoning Race: Can Small Models Reason?

And Some Major Changes in The Sequence you shuld read about.

We’re also discussing potential price changes for new subscribers (current subscribers won’t be affected), so I encourage you to subscribe in the next few days if you haven’t already.

Is Reasoning Exclusive to Massive Models or do Small Models Have a Chance?

Understanding Reasoning in Large Language Models...

Subscribe to TheSequence to unlock the rest.

A subscription gets you:

Older messages

Edge 462: What is Fast-LLM. The New Popular Framework for Pretraining your Own LLMs

Edge 461: The Many Challenges of Kowledge Distillation

Moving Past RLHF: In 2025 We Will Transition from Preference Tuning to Reward Optimization in Foundation Models

Edge 460: Anthropic's New Protocol to Link AI Assistants to Data Sources

Edge 459: Quantization Plus Distillation

You Might Also Like

Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator

Defining Your Paranoia Level: Navigating Change Without the Overkill

5 ways AI can help with taxes 🪄

Recurring Automations + Secret Updates

The First Provable AI-Proof Game: Introducing Butterfly Wings 4

GCP Newsletter #437

Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰

The Great Social Media Diaspora & Tapestry is here

Daily Coding Problem: Problem #1689 [Medium]

📧 Stop Conflating CQRS and MediatR