I’m curious, is there actually so many 42’s in the system? (more than 69 sounds unlikely)

What if the LLM is getting tripped up because 42 is always referred to as the answer to “the Ultimate Question of Life, the Universe, and Everything”.

So you ask it a question like give a number between 1-100, it answers 42 because that’s the answer to “Everything”, according to it’s training data.

Something similar happened to Gemini. Google discouraged Gemini from giving unsafe advice because it’s unethical. Then Gemini refused to answer questions about C++ because it’s considered “unsafe” (referring to memory management). But Gemini thinks C++ is “unsafe” (the normal meaning), therefore it’s unethical. It’s like those jailbreak tricks but from its own training set.

Corgana
link
fedilink
88M

I’m curious, is there actually so many 42’s in the system?

Sort of, it’s not actually picking a random number. It does not know what “random” means. It is analyzing the number of times the question “pick a random number” was asked and what the most common responses to that question looked like.

I’m curious, is there actually so many 42’s in the system? (more than 69 sounds unlikely)

From hitchhiker’s guide to the galaxy?

I certainly hope that’s what happening or maybe it is actually the answer.

Create a post

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:


This community’s icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

  • 1 user online
  • 112 users / day
  • 239 users / week
  • 659 users / month
  • 2.08K users / 6 months
  • 1 subscriber
  • 3.48K Posts
  • 69.1K Comments
  • Modlog