AI Chatbots Fail 8th Grade Math Test

Tech Monday, July 14, 2025 by Zac 11

Popular AI chatbots like ChatGPT and Gemini were recently given a math test designed for 8th-grade students. Surprisingly, they all struggled with one particular question.

What are Chatbots?

Chatbots are computer programs that use artificial intelligence (AI) to understand and respond to questions and commands. They're trained on huge amounts of text data, allowing them to generate text, answer questions, and even have conversations that feel somewhat human-like. ChatGPT, created by OpenAI, was one of the first to become widely known. Now, many companies have their own AI models, including Google (Gemini), DeepSeek, Claude, and Perplexity.

The 8th Grade Math Test

A user on Reddit decided to test these chatbots by giving them a math test meant for 8th graders. The AI models tested were OpenAI's o3, Gemini 2.5 Pro, and Claude Sonnet 4. They had to answer 15 questions without any extra help or hints. The user also made sure the questions were new and hadn't been used to train the AI models before. The Gemini version used was an older one.

How Did They Do?

OpenAI's model and Gemini both got 14 out of 15 questions right. However, they both failed on the same question, question 12. Claude's model did a bit worse, answering only 12 questions correctly. The Reddit user noted that they didn't have access to Claude's most powerful model, which might have performed better.

The Problem Question

The tricky question involved a number line with points A, B, and C marked on it. The distance between points A and C was divided into 6 equal parts. The number line also showed the coordinates 56 and 83. The students (or in this case, the AI) had to decide if these two statements were true or false:

The coordinate of point C is an even number.
The coordinate of point B is a number less than 74.

Why Was It So Hard?

To solve the problem, you first need to figure out the length of each section on the number line. The distance between the coordinates 56 and 83 covers three sections. The total distance between 56 and 83 is 27 units (83 - 56 = 27). So, each section is 9 units long (27 / 3 = 9). Knowing this, you can find the coordinates of point C. The correct answers are:

The first statement is FALSE. Point C is at coordinate 101 (56 + 6*9), which is an odd number.
The second statement is TRUE. Point B is to the left of coordinate 74 on the number line (56 + 2*9 =74 ).

The Chatbots' Mistake

A screenshot showed that ChatGPT incorrectly assumed that point B was exactly at coordinate 74. Because of this, it wrongly concluded that point B was not less than 74, but equal to it. When the test was repeated with Gemini, it made the same exact mistake.

This shows that even though AI chatbots are very advanced, they can still struggle with certain types of problems, especially those involving visual information and spatial reasoning.

New Cheap AI from China Beats ChatGPT – Elon Musk Amazed

China is speeding up its work in artificial intelligence (AI). They've created a new AI that's better than ChatGPT and even Elon Musk's Grok in some areas. This is making things tough for the competition because this new AI is very cheap. A startup call..

Tech Tuesday, July 15, 2025 BY Nokl

The Commodore 64 is Back! A Retro Icon Returns This Fall

Remember the Commodore 64? For many, it was their first computer, a gateway to the digital world filled with pixelated graphics and distinctive loading sounds. Now, more than 40 years after its debut, this legendary home computer is making an official come..

Tech Sunday, July 13, 2025 BY Tom

Samsung's Galaxy S25 Ultra on Sale at Amazon

Good news for Samsung fans! The Galaxy S25 Ultra, the company's top-of-the-line phone, is currently available for under $1,000 on Amazon. This is a great deal if you're looking for a powerful, feature-packed smartphone at a slightly more affordable price. ..

Tech Friday, July 11, 2025 BY Eve

I've Never Wanted AirPods, But This Record-Low Prime Day Price is Seriously Tempting

Prime Day can make you do things you never thought you would! For example, I've never wanted AirPods. Actually, I've been against them. I see how popular they are, but that just made me want them less. But the prices during Prime Day are making me rethink ..

Tech Friday, July 11, 2025 BY Bob

Tools to Protect Art from AI Can Be Easily Bypassed

New tools are being developed to help artists protect their work from being used to train artificial intelligence (AI) models. However, researchers have discovered ways to get around these protections. AI models that create images need to be trained on a ..

Tech Friday, July 11, 2025 BY Bob

Prime Day Vacuum Deals: Don't Miss Out!

Robot vacuums can be a big investment for your smart home. That's why sales events like Amazon Prime Day are the perfect time to snag one at a lower price. Prime Day 2025 has brought some fantastic discounts on robot vacuums and cordless vacuums. Best ..

Tech Friday, July 11, 2025 BY Mary

Stop Paying for PDF Editors! Get PDF Reader Pro for Life for Just $30

Are you tired of expensive subscriptions for editing PDFs? Adobe Acrobat's got some competition! Now you can get PDF Reader Pro for a one-time payment of only $30. Why is this a good deal? Most PDF editors make you pay every month or year to use their fe..

Tech Friday, July 11, 2025 BY Bob

The best Anker Prime Day deals: Last chance to save on power banks, chargers and mobile accessories

Prime Day might be ending soon, but there's still time to grab some great deals on essential tech gadgets. Maybe you already have the big-ticket items like iPads or robot vacuums. This Prime Day, focus on the smaller, but equally important, accessories you..

Tech Friday, July 11, 2025 BY Marvin

Best Prime Day Deals 2025: Last Chance!

Prime Day 2025 is almost over! Today is the fourth and final day to grab some amazing deals on Amazon. The sale, which started on July 8th, ends tonight, so don't miss out. What's on Sale? Amazon is offering big discounts on many popular products. Som..

Tech Friday, July 11, 2025 BY Eve

AirPods Temptation Arises During Prime Day

I've never been one for AirPods. The hype surrounding them actually made me less interested. But now, with Prime Day in full swing, I'm facing a tough decision. The price of the Apple AirPods Pro 2 has dropped to an all-time low, and the pressure to buy is..

Tech Friday, July 11, 2025 BY Mary

AI Chatbots Fail 8th Grade Math Test

What are Chatbots?

The 8th Grade Math Test

How Did They Do?

The Problem Question

Why Was It So Hard?

The Chatbots' Mistake

Similar articles