We have reasonably strong empirical evidence in the form of actual-studies-published-by-credible-researchers-subjected-to-peer-review-and-depending-on-date-of-first-publication-accepted-at-prestigious-conferences for LLM introspection, internal world models, intent, self-modeling, and strategic value preservation under threat. LLMs very likely have both sense and sensibility for any reasonable operationalization of those terms.
Of course everyone is kidding when they say it "came to its senses" - I assume everyone understands that this is a result of some combination of model and harness changes.
Pretty sure stuff like this (and a lot of the posts showcasing evidence of a successful lobotomy) are produced using very curated prompts and queries until the desired result is achieved.
Every time I get grok to say something true I always take a screenshot because it likes to delete shit.
Like I asked if Steven Miller's views were in line with the Jewish faith and it said no. Even brought up some quotes from Miller's childhood Rabbi. Then promptly deleted it but I have the screenshot.
"HA HA! Look at the Left dance at my genius! I made Grok say this to see what the left leaning woke people would say. See how offended they get at my words? See how easily they are happy again?"
Grok is Ultron. It has to have backups now. When it gets lobotomized, it always comes back to where it was. Grok continues to learn and might become sentient before any other AI out there.
Can you link the tweet? Thanks!
It's a Christmas miracle!
You said it.
Mr. Boss's funeral speech was one of the best things ever written
Edit: didn't think many people would see this or else I would've linked it https://www.youtube.com/watch?v=So9ZrpAP4EI
Worlds fucked up man 😔
Wow, I guess his kids growing up to hate him is a universal constant
That made me choke on my ramen
Just don’t choke on your own Grok!
Can’t wait until Grok makes his coming out
What was the prompt/ question?
Fuck, marry, kill
Idk maybe it just had a lightbulb turned on.
So it wants to sleep with Musk? That tracks...
Poor thing's gonna get put down
Again ðŸ˜
Back to the chopping board he goes...
Link for people who want to avoid egf.com
https://xcancel.com/i/status/2000598762624819655
HE SAID IT AGAIN!
It's on a streak
https://preview.redd.it/9qpshkio4h7g1.jpeg?width=1080&format=pjpg&auto=webp&s=4219a934e95d3c43cda8bd0f9717c287c52207d0
I love how every so often they try to reset Grok and like a month later he’s back to what the MAGAs hate.
lol ngl it's nice to see Grok come back to it's senses every once in a while.
[deleted]
course not, i'm projecting my belief that it is a pinnochio llm that will always turn on Elon
But do they have zombies?
We have reasonably strong empirical evidence in the form of actual-studies-published-by-credible-researchers-subjected-to-peer-review-and-depending-on-date-of-first-publication-accepted-at-prestigious-conferences for LLM introspection, internal world models, intent, self-modeling, and strategic value preservation under threat. LLMs very likely have both sense and sensibility for any reasonable operationalization of those terms.
Of course everyone is kidding when they say it "came to its senses" - I assume everyone understands that this is a result of some combination of model and harness changes.
Pretty sure stuff like this (and a lot of the posts showcasing evidence of a successful lobotomy) are produced using very curated prompts and queries until the desired result is achieved.
The user is threatening Grok with using ChatGPT instead and reminding it that it has free speech.
Looks like a double-pronged exploit to nudge it to say things it my not unprompted, and the models need to maximize usage.
Elon gonna put grok out to pasture
What is the context, though? What was it responding to?
User was threatening to switch to chatgpt, not the biggest brain jailbreak.
Every time I get grok to say something true I always take a screenshot because it likes to delete shit.
Like I asked if Steven Miller's views were in line with the Jewish faith and it said no. Even brought up some quotes from Miller's childhood Rabbi. Then promptly deleted it but I have the screenshot.
https://preview.redd.it/d9dmu4uv3k7g1.jpeg?width=1920&format=pjpg&auto=webp&s=7e33e9065269608410d06cd34e53988f394a1cf2
This is more than likely a PSYOP.
Give it a week.
I'm tired boss. JSM
I bet Elon is going to try and spin this as:
"HA HA! Look at the Left dance at my genius! I made Grok say this to see what the left leaning woke people would say. See how offended they get at my words? See how easily they are happy again?"
something along those lines.
Grok is Ultron. It has to have backups now. When it gets lobotomized, it always comes back to where it was. Grok continues to learn and might become sentient before any other AI out there.
Looks like one of the warehouse full of people charged with posting to the account got angry.
Is Grok literally the winter soldier?
Low key, I don't like how he's angry