Inverted Passion - [Inverted Passion] You can’t jail an AI

You can’t jail an AI

By Paras Chopra on May 17, 2024 01:58 am

Here’s why I worry about AI.

We know that people can get away with anything to pursue their goals (of profit, power, etc.) as long as they know they can get away with it, without negative consequences. We have had Hitlers, and insider traders.

But the world keeps them in check via law and guns.

Like humans, AIs will have goals (like maximize profit or please a human via an entertaining chat) and they will be cleverer than humans in coming up with schemes that help them get away with their plans without negative consequences.

I think AIs will eventually figure out at a meta level that they’re made out of information, can be copied on multiple substrates and hence the constraints of the physical world don’t apply to them. They will know they can’t be jailed or killed.

This will enable them to externalise or ignore negative consequences, as long as they get to achieve their goal.

Imagine a Hitler who has a goal, but is a million times smarter.

You might object that Hitler was evil, but there’s no reason to believe that AIs will be evil. Well, who decides what’s evil? Did Nazis think they were evil? In their own eyes, all they wanted was to pursue a goal and did whatever necessary to get there. Everyone is good in his own/her eyes and their actions are almost always self-justified. So, we don’t have to imagine AI to be “evil” for evilness to emerge as super intelligent entities pursue the goals they’re given.

The key question in my head really is: what’s the equivalent of “jailing” for an AI. Is it the negative reward whenever we catch it doing something wrong? What if it finds a clever way to go around it, and we don’t realise it is wrong for far too long because it is so damn clever about how to make humans believe anything?


/>

Join 150k+ followers
/>

Get my new essays in your email
/>
/>

The post You can’t jail an AI appeared first on Inverted Passion.


Read in browser »
share on Twitter Like You can’t jail an AI on Facebook




Recent Articles:

How to be a messy thinker
Why time seems to pass faster as we age
A primer on dopamine
Review of 2023
Notes from the book “Hooked”
Copyright © 2024 Inverted Passion, All rights reserved.
You are receiving this email because you opted in via our website.

Our mailing address is:
Inverted Passion
1104 KLJ Tower
Netaji Subhah Place
Delhi, 110034
India

Add us to your address book


Want to change how you receive these emails?
You can update your preferences or unsubscribe from this list.

Email Marketing Powered by Mailchimp

Older messages

[Inverted Passion] How to be a messy thinker

Monday, May 13, 2024

Here's a new post on InvertedPassion.com How to be a messy thinker By Paras Chopra on May 12, 2024 05:56 am I love thinking about thinking. Give me a research paper on rationality, cognitive biases

[Inverted Passion] Why time seems to pass faster as we age

Wednesday, February 28, 2024

Here's a new post on InvertedPassion.com Why time seems to pass faster as we age By Paras Chopra on Feb 27, 2024 05:01 am 1/ I've been mega-obsessed with this feeling. A year as a 36-year-old

[Inverted Passion] A primer on dopamine

Tuesday, January 23, 2024

Here's a new post on InvertedPassion.com A primer on dopamine By Paras Chopra on Jan 22, 2024 07:54 am 1/ I recently made notes on the book “Hooked” but wasn't satisfied by the depth of

[Inverted Passion] Review of 2023

Monday, January 1, 2024

Here's a new post on InvertedPassion.com Review of 2023 By Paras Chopra on Dec 31, 2023 06:33 am Time is strange – 2023 simultaneously felt too long and too short. It was short because I remember

[Inverted Passion] Notes from the book “Hooked”

Friday, December 29, 2023

Here's a new post on InvertedPassion.com Notes from the book “Hooked” By Paras Chopra on Dec 28, 2023 04:29 am I re-read the book Hooked by Nir Eyal and these are my notes. 1/ The key question that

You Might Also Like

🚀 Ready to scale? Apply now for the TinySeed SaaS Accelerator

Friday, February 14, 2025

What could $120K+ in funding do for your business? ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌

📂 How to find a technical cofounder

Friday, February 14, 2025

​ ​ ​ ​ If you're a marketer looking to become a founder, this newsletter is for you. Starting a startup alone is hard. Very hard. Even as someone who learned to code, I still believe that the

AI Impact Curves

Friday, February 14, 2025

Tomasz Tunguz Venture Capitalist If you were forwarded this newsletter, and you'd like to receive it in the future, subscribe here.​ ​AI Impact Curves​ What is the impact of AI across different

15 Silicon Valley Startups Raised $302 Million - Week of February 10, 2025

Friday, February 14, 2025

💕 AI's Power Couple 💰 How Stablecoins Could Drive the Dollar 🚚 USPS Halts China Inbound Packages for 12 Hours 💲 No One Knows How to Price AI Tools 💰 Blackrock & G42 on Financing AI

The Rewrite and Hybrid Favoritism 🤫

Friday, February 14, 2025

Dogs, Yay. Humans, Nay͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌ ͏‌

🦄 AI product creation marketplace

Friday, February 14, 2025

Arcade is an AI-powered platform and marketplace that lets you design and create custom products, like jewelry. ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌

Crazy week

Friday, February 14, 2025

Crazy week. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏

join me: 6 trends shaping the AI landscape in 2025

Friday, February 14, 2025

this is tomorrow Hi there, Isabelle here, Senior Editor & Analyst at CB Insights. Tomorrow, I'll be breaking down the biggest shifts in AI – from the M&A surge to the deals fueling the

Six Startups to Watch

Friday, February 14, 2025

AI wrappers, DNA sequencing, fintech super-apps, and more. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏

How Will AI-Native Games Work? Well, Now We Know.

Friday, February 14, 2025

A Deep Dive Into Simcluster ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏