People are increasing "chatfishing," using AI apps to generate texts to their romantic interests. Sometimes they'll slip up ...
These short anomaly-detection puzzles are designed to illustrate how reasoning often depends on identifying inconsistencies ...
Qwen 3.6 27B actually gave me better answers in basically every test.