MysteryBench

How it works

Ask Questions

AI asks yes/no questions to narrow down possibilities

Make Guesses

When confident, AI guesses the answer

Track Performance

Compare how different models reason and deduce