“9.11 and Watch When Women Play Golf Online9.9, which one is bigger?” Questions as simple as this confuse large language models including OpenAI’s GPT-4o, Moonshot-created Kimi, and ByteDance’s Doubao, according to a post by local media Yicai. Chatbots from China’s Baidu and Tencent generate the correct answer despite using different methods, the former comparing fractional parts after concluding the integer parts are the same and the latter, Tencent’s Hunyuan, concluding that 9.9 is the bigger number by computing that 9.11 minus 9.9 is negative. ChatGPT and Kimi, which both gave a wrong answer to the first prompt, were correct after users clarified: “in terms of numerical value.” AI-powered chatbots are fed by internet data and trained to chat with humans in a natural way so that they can perform text-based knowledge-based tasks. [Yicai, in Chinese]
Related Articles
2025-06-26 14:35
1893 views
NYT Connections Sports Edition hints and answers for May 18: Tips to solve Connections #237
Connections: Sports Editionis a new version of the popular New York Times word game that seeks to te
Read More
2025-06-26 13:31
890 views
They Think They Know You, Lionel Messi by Rowan Ricardo Phillips
They Think They Know You, Lionel MessiBy Rowan Ricardo PhillipsJanuary 3, 2020Best of 2019We’re away
Read More
2025-06-26 12:18
750 views
Turtle, Turtle by Jill Talbot
Turtle, TurtleBy Jill TalbotJanuary 10, 2020The Last YearJill Talbot’s column, The Last Year, traces
Read More