Edge 342: Who's Happy Potter? Inside One of the Most Fascinating Papers Published This Year

Microsoft Research’s details a fine-tuning method for unlearning concepts in LLMs

Nov 9

READ IN APP

Create a cinematic scene featuring an advanced humanoid robot sitting in a dimly lit, cluttered attic, surrounded by dusty books and various magical paraphernalia. The robot's face displays a puzzled expression as it holds an open book with the title 'Wizardry & Witchcraft' barely visible. Scattered on the floor are a broomstick, a wizard's hat, and a pair of round glasses. The robot's finger is on a page as if it's trying to comprehend the text, with a holographic projection above its head showing a broken link symbol and question marks, symbolizing the forgetting of information. The atmosphere should convey a sense of confusion and forgetfulness. — Created Using DALL-E

Large language models(LLMs) are regularly trained in vast amounts of unlabeled data, which often leads to acquiring knowledge of incredibly diverse subjects. The datasets used in the pretraining of LLMs often including copyrighted material, triggering both legal and ethical concerns for developers, users, and original content creators. Quite often, specific knowledge from LLMs is required to be removed in order to adapt it to a specific domain. While the learning in LLMs is certainly impressive, the unlearning of specific concepts remains a very nascent area of exploration. While fine-tuning methods are certainly effective for incorporating new concepts, can they be used to unlearn specific knowledge?

In one of the most fascinating papers of this year, Microsoft Research explores an unlearning technique for LLMs. The challenge was nothing less than making Llama-7B to forget any knowledge of Harry Potter.

The Unlearning Challenge in LLMs...

Subscribe to TheSequence to read the rest.

Become a paying subscriber of TheSequence to get access to this post and other subscriber-only content.

A subscription gets you:

	Full access to TheSequence Edge – what's new in AI + the most relevant ML concepts, research papers, tech solutions
	Full archive
	Comments and discussions

Like

Comment

Restack

Edge 342: Who's Happy Potter? Inside One of the Most Fascinating Papers Published This Year

Edge 342: Who's Happy Potter? Inside One of the Most Fascinating Papers Published This Year

Microsoft Research’s details a fine-tuning method for unlearning concepts in LLMs

The Unlearning Challenge in LLMs...

Subscribe to TheSequence to read the rest.

A subscription gets you:

Older messages

The Sequence Chat: Nathan Benaich, Air Street Capital About Investing in Generative AI

Edge 341: What is Prompt-Tuning?

📝 Guest Post: Introduction to DiskANN and the Vamana Algorithm*

DeepMind's AlphaFold-Latest is Pushing the Boundaries of Scientific Exploration

📣 ML Engineering Event: Join HelloFresh, Remitly, Riot Games, Uber & more at apply(ops)

You Might Also Like

Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator

Defining Your Paranoia Level: Navigating Change Without the Overkill

5 ways AI can help with taxes 🪄

Recurring Automations + Secret Updates

The First Provable AI-Proof Game: Introducing Butterfly Wings 4

GCP Newsletter #437

Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰

The Great Social Media Diaspora & Tapestry is here

Daily Coding Problem: Problem #1689 [Medium]

📧 Stop Conflating CQRS and MediatR