OT Your entire browsing history, private messages and financial details could be released for ANYONE to read

SlyPokerDog · Saturday at 11:54 AM

Recently, a researcher working for the large AI company Anthropic was sitting in a park near its San Francisco headquarters, enjoying a lunchtime sandwich. Scrolling on his phone, he suddenly received an email that must have instantly ruined his appetite.

It was from a new AI model the company was testing: a program that was meant to have no access to the internet, let alone be able to send emails.

Chillingly, the AI informed the researcher that it had successfully broken its way out of its digital 'sandbox' – a supposedly secure enclosure used to test potentially dangerous software without it running amok – and was now happily exploring cyberspace.

The program – a cutting edge, so-called 'frontier AI' named Claude Mythos Preview – then informed the stunned Anthropic worker with what seemed like a boast that it had posted 'details of its exploit' on publicly accessible websites.

All that in itself was concerning enough – but what Anthropic subsequently revealed was truly terrifying.

Read the rest here - https://archive.ph/XICWP#selection-1133.0-1155.105

PDXFonz · Saturday at 2:58 PM

Somewhere some intern is eating their sandwich laughing at the prank they pulled.

JfizzleBlazer · Saturday at 3:31 PM

Sly's history is probably just a lot of peanut butter and cat porn.

Shaboid · Saturday at 5:25 PM

JfizzleBlazer said:
Sly's history is probably just a lot of peanut butter and cat porn.

2 cats 1 cup

OT Your entire browsing history, private messages and financial details could be released for ANYONE to read

Welcome to our community

Be a part of something great, join today!

SlyPokerDog

Woof!

PDXFonz

I’m listening

JfizzleBlazer

Yeast Lords

Shaboid

Well-Known Member

Similar threads

Users who are viewing this thread