For those of you who didn’t read the paper, the argument they’re making is similar to Godel’s Incompleteness Theorem: no matter how you build your LLM, there will be a significant number of prompts that make that LLM hallucinate. If the proof holds up then hallucinations aren’t a limitation of the training data or the structure of your particular model, they’re a limitation of the very concept of an LLM. That doesn’t make LLMs useless, but it does mean you shouldn’t ever use one as a source of truth.
Mostly Cities Skylines 2. The performance is not great, but it’s passable with the settings turned down and the actual city building is really good. Right now I’m working on a big expansion to my city further down the highway, just got the water/power lines run between them so it’s one big happy grid exporting the extra to other cities. Already looking forward to the things I’ll do differently in my second city!
Also playing some Super Mario Wonder on the side, which is fantastic. Great mix of easy fun levels and hard-as-nails secret special levels. Very fun!
It’s there a major anniversary for The 7th Guest coming up or something? Between this article resurfacing, the VR Remake, and the industrial rock cover album on vinyl I’m seeing a lot of T7G content lately. And I’m here for it!
Prior to the API fiasco, Reddit Inc had demonstrated a pattern of promising changes to the mods which they failed to deliver timely if at all. They’ve acknowledged this pattern, promised to do better, then failed to deliver time and again. That part isn’t new.
Then the API changes were announced and the Reddit community gave Reddit Inc the loudest and most decisive rebuke they ever have. That was the feedback conversation. And Reddit Inc went forward with their plan unchanged. No concessions were made. No concerns were addressed or alleviated. Reddit Inc was informed of what this decision would break and they went ahead and broke it anyway.
As a former mod, there is nothing left to discuss. There is no reason to believe Reddit Inc will act on anything that doesn’t agree with what they’ve already decided to do. I’m not going back to that kind of abusive relationship. They had their chance to listen to feedback and made it clear that they won’t.
The 7th Guest and The 11th Hour remain top-tier for me. The music was half the experience, turn it up and just enjoy that creepy vibe while solving puzzles. And they’re all on The Fat Man’s Bandcamp now!
Also, there’s an industrial rock cover album on vinyl currently looking for funding on Kickstarter. I’m super excited!
Which is exactly what the paper recommends! As long as you have something that isn’t an LLM in the pipeline to vet the output and you’re aware is the tech’s limitations, they can be useful tools. But some of those limitations might be a more solid barrier than some sales departments would like us to believe.