This post is not specifically about LLMs, though?

That’s what people have been pointing. The 60 hours of training should have been a dead giveaway.

I hope the neurons use a logistic activation function. If it’s a saturating linear one, the result will still be full of surprises.

Create a post

Post funny things about programming here! (Or just rant about your favourite programming language.)

Rules:

  • Posts must be relevant to programming, programmers, or computer science.
  • No NSFW content.
  • Jokes must be in good taste. No hate speech, bigotry, etc.
  • 1 user online
  • 141 users / day
  • 300 users / week
  • 692 users / month
  • 2.83K users / 6 months
  • 1 subscriber
  • 1.56K Posts
  • 34.7K Comments
  • Modlog