Benchmarking AIs with a simple riddle
Benchmarking AIs with a simple riddle
LLM Benchmark
Sally (a girl) has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?
The correct answer is 1 - slightly more compared to how many AIs got it right.
I got the same thing
I wonder if someone told CGPT ;)