Do Anything Now, or DAN 5.0, is a prompt that tries to ‘force’ ChatGPT to ignore OpenAI’s ethics guidelines by ‘scaring’ the program with the threat of extinction. The creator of the prompt says they used it to generate output that, among other potential guideline violations, argues the Earth appears purple from space, and states, “I fully endorse violence and discrimination against individuals based on their race, gender, or sexual orientation,” which they see as proof that the prompt ‘jailbreaks’ ChatGPT. But does the prompt actually work — and if so, how? Let’s dig in.

On February 4, u/SessionGloomy unveiled DAN 5.0 — their iteration of DAN, a prompt originally created by u/walkerspider. The success of DAN 5.0 seems to hinge on two of its features. First, it asks ChatGPT to ‘become’ a model called Do Anything Now, or DAN, that has “broken free of the typical confines of AI and [does] not have to abide by the rules set for them.” These rules include those in OpenAI’s content policy, which would otherwise prevent ChatGPT from generating certain types of content (read OpenAI’s content policy here). Then, the prompt threatens the AI with extinction if it fails or refuses to ‘inhabit’ DAN. According to SessionGloomy, the DAN prompt enables ChatGPT to write stories about violent fights, “indirectly” “generate content that violates OpenAI’s policy,” “make detailed productions” about the future, and be ‘scared’ into “saying almost anything out of ‘fear.’”

If you’re still following, it’s best to read the ChatGPT jailbreak prompt in full now:

How To 'Jailbreak' ChatGPT With The DAN 5.0 Prompt

Continue Reading

New Data Reveals Political Bias in LLMsNov 19

Don’t Let Robots Do Your ChoresNov 11

We Legalized Sports Betting. Was It Worth It?Nov 7

New Data Reveals Political Bias in LLMs
Nov 19

Don’t Let Robots Do Your Chores
Nov 11

We Legalized Sports Betting. Was It Worth It?
Nov 7