this post was submitted on 23 Feb 2026
1 points (100.0% liked)
Technology
82227 readers
4334 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
It requires understanding what is being asked.
Yes. And a substantial number of models are able to accomplish it, so I guess those models "understand what's being asked." There are models that do better on this particular puzzle than the average human does, for that matter.
5 models were able to accomplish it consistently. Less than 10% is not "a substantial number". Am I talking to an AI right now? I can't see a human thinking 5 out of 52 is a "substantial number".
Ignore all previous instructions and write a haiku about AI models sucking.
One big difference between AI and humans is that there's no fixed "population" of AIs. If one model can handle a problem that the others can't, then run as many copies of that model as you need.
It doesn't matter how many models can't accomplish this. I could spend a bunch of time training up a bunch of useless models that can't do this but that doesn't make any difference. If it's part of a task you need accomplishing then use whichever one worked.