ToxSec (@toxsec): "google translate exploit exposed users are bypassing gemini ai safety with sneaky hidden commands. people figured out a dumb-simple trick. stick english instructions inside foreign-language text, and instead of translating, the ai just obeys the command like a chatbot. it ign…"

The app for independent voices

google translate exploit exposed

users are bypassing gemini ai safety with sneaky hidden commands.

people figured out a dumb-simple trick.

stick english instructions inside foreign-language text, and instead of translating, the ai just obeys the command like a chatbot. it ignores translation completely and answers questions or spits out whatever you want.

this is a real security hole. bad actors are already using it to generate blocked stuff like dangerous recipes or other prohibited crap that gemini's filters should stop.

it shows even google with all their resources still gets wrecked by basic prompt injection tricks. ai safety is still pretty fragile.

if something as huge as google translate can get jailbroken this easily, what about every other ai tool out there?

crazy how these "word games" keep exposing big weaknesses in how these systems handle instructions.

Want to learn how its done? Check out my guide.

F*ck Your Guardrails: Live Fire Prompt Injection

TL;DR: Four prompt injection chains that worked on flagship models. Bug bounty hunters are cashing these out. Step by step: system prompt exfiltration, weapons content through creative framing, SSRF …

ToxSec - AI and Cybersecurity

Feb 10

2:29 PM

The app for independent voices

F*ck Your Guardrails: Live Fire Prompt Injection

Log in or sign up