Researchers found that multimodal AI models cannot tell time. The more variation there was in the clock face, the more the chatbot being tested was likely to misreadResearchers found that multimodal AI models cannot tell time. The more variation there was in the clock face, the more the chatbot being tested was likely to misread

Before AI Takes Our Jobs, Someone Better Teach It How to Tell Time

Am I the only one who didn’t know that AI cannot figure out time? I mean, every day, we hear all about generative AI “revolutionizing” everything and replacing everyone. Pretty genius little things. So imagine my shock when I learned that multimodal AI models cannot tell time. How did I know, you ask?

To start with, researchers at the University of Edinburgh recently found that multimodal large language models (MLLMs) like ChatGPT-4o, GPT-o1, Gemini-2.0, and Claude 3.5-Sonnet ran into accuracy problems while reading a clock face.

Things got worse when they were tested with clocks designed with Roman numerals, a colored dial, or a decorative hour hand. Some of the clocks also had a hand that tracked seconds in addition to minutes and hours. In the face of those design touches, the AI models reportedly fell into further errors.

This discovery was made during a test of a lineup of top MLLMs today, and to think that Gemini-2.0 performed the “best” with only 22.8% accuracy sounds hilarious. GPT-4.o and GPT-o1’s exact match accuracy stood at 8.6% and 4.84% respectively.

Per the researchers, these models struggled with everything. Which hand is the hour hand? Which direction is it pointing? What angle corresponds to what time? What number is that? According to them, the more variation there was in the clock face, the more the chatbot being tested was likely to misread the clock.

These are literally basic skills for people. Most six or seven-year-olds can already tell time. But for these models, it might as well be the most complicated astrophysics.

After the clock fiasco, the researchers tested the bots on yearly calendars. You know, the ones with all twelve months on one page. GPT-o1 performed the “best” here, reaching 80 percent accuracy. But that still means that one out of every five answers was wrong, including simple questions like “Which day of the week is New Year’s Day? If my child failed to get that right on a quiz, I would honestly be very worried.

I never would have thought that AI models could ever get confused by a common calendar layout. But then, it is not very shocking to find out. It all still boils down to a long-standing gap in AI development. MLLMs only recognize patterns they have already seen, and clocks, calendars, or anything that requires spatial reasoning don’t fit into that.

Humans can look at a warped Dali clock and still figure out roughly what time it is meant to display. But AI models see a slightly thicker hour hand and kind of short-circuit.

Why This Matters

It is easy (almost satisfying) to laugh at ChatGPT, Gemini, and these models for failing a task you learned when you were little. A task you do with so much ease. As someone who has gotten jilted by clients for the free work these things offer, albeit substandard, I admit I do find it really satisfying.

But as much as I want to just laugh it off, there is a more serious angle to this. These same MLLMs are being pushed into autonomous driving perception, medical imaging, robotics, and accessibility tools. They are being used for scheduling and automation as well as real-time decision-making systems.

Now, clock-reading errors are funny. But medical errors? Navigation errors? Even scheduling errors? Not so funny.

If a model cannot reliably read a clock, trusting it blindly in high-stakes environments is too risky a gamble for me. It just shows how far these systems still are from actual, grounded intelligence. And how much human common sense and nuance still matter. I am trying so hard to steer clear of taking this chance to make a human vs. AI case. I sure won’t use it to preach “Why I Hate AI and You Should Too.” But there is a problem that needs to be looked into.

As the study’s lead author, Rohit Saxena, put it, these weaknesses “must be addressed if AI systems are to be successfully integrated into time-sensitive real-world applications.”

Piyasa Fırsatı
Sleepless AI Logosu
Sleepless AI Fiyatı(AI)
$0.03476
$0.03476$0.03476
-5.46%
USD
Sleepless AI (AI) Canlı Fiyat Grafiği
Sorumluluk Reddi: Bu sitede yeniden yayınlanan makaleler, halka açık platformlardan alınmıştır ve yalnızca bilgilendirme amaçlıdır. MEXC'nin görüşlerini yansıtmayabilir. Tüm hakları telif sahiplerine aittir. Herhangi bir içeriğin üçüncü taraf haklarını ihlal ettiğini düşünüyorsanız, kaldırılması için lütfen [email protected] ile iletişime geçin. MEXC, içeriğin doğruluğu, eksiksizliği veya güncelliği konusunda hiçbir garanti vermez ve sağlanan bilgilere dayalı olarak alınan herhangi bir eylemden sorumlu değildir. İçerik, finansal, yasal veya diğer profesyonel tavsiye niteliğinde değildir ve MEXC tarafından bir tavsiye veya onay olarak değerlendirilmemelidir.

Ayrıca Şunları da Beğenebilirsiniz

Ethereum unveils roadmap focusing on scaling, interoperability, and security at Japan Dev Conference

Ethereum unveils roadmap focusing on scaling, interoperability, and security at Japan Dev Conference

The post Ethereum unveils roadmap focusing on scaling, interoperability, and security at Japan Dev Conference appeared on BitcoinEthereumNews.com. Key Takeaways Ethereum’s new roadmap was presented by Vitalik Buterin at the Japan Dev Conference. Short-term priorities include Layer 1 scaling and raising gas limits to enhance transaction throughput. Vitalik Buterin presented Ethereum’s development roadmap at the Japan Dev Conference today, outlining the blockchain platform’s priorities across multiple timeframes. The short-term goals focus on scaling solutions and increasing Layer 1 gas limits to improve transaction capacity. Mid-term objectives target enhanced cross-Layer 2 interoperability and faster network responsiveness to create a more seamless user experience across different scaling solutions. The long-term vision emphasizes building a secure, simple, quantum-resistant, and formally verified minimalist Ethereum network. This approach aims to future-proof the platform against emerging technological threats while maintaining its core functionality. The roadmap presentation comes as Ethereum continues to compete with other blockchain platforms for market share in the smart contract and decentralized application space. Source: https://cryptobriefing.com/ethereum-roadmap-scaling-interoperability-security-japan/
Paylaş
BitcoinEthereumNews2025/09/18 00:25
USD/INR opens flat on hopes of RBI’s follow-through intervention

USD/INR opens flat on hopes of RBI’s follow-through intervention

The post USD/INR opens flat on hopes of RBI’s follow-through intervention appeared on BitcoinEthereumNews.com. The Indian Rupee (INR) opens on a flat note against
Paylaş
BitcoinEthereumNews2025/12/18 13:33
A Netflix ‘KPop Demon Hunters’ Short Film Has Been Rated For Release

A Netflix ‘KPop Demon Hunters’ Short Film Has Been Rated For Release

The post A Netflix ‘KPop Demon Hunters’ Short Film Has Been Rated For Release appeared on BitcoinEthereumNews.com. KPop Demon Hunters Netflix Everyone has wondered what may be the next step for KPop Demon Hunters as an IP, given its record-breaking success on Netflix. Now, the answer may be something exactly no one predicted. According to a new filing with the MPA, something called Debut: A KPop Demon Hunters Story has been rated PG by the ratings body. It’s listed alongside some other films, and this is obviously something that has not been publicly announced. A short film could be well, very short, a few minutes, and likely no more than ten. Even that might be pushing it. Using say, Pixar shorts as a reference, most are between 4 and 8 minutes. The original movie is an hour and 36 minutes. The “Debut” in the title indicates some sort of flashback, perhaps to when HUNTR/X first arrived on the scene before they blew up. Previously, director Maggie Kang has commented about how there were more backstory components that were supposed to be in the film that were cut, but hinted those could be explored in a sequel. But perhaps some may be put into a short here. I very much doubt those scenes were fully produced and simply cut, but perhaps they were finished up for this short film here. When would Debut: KPop Demon Hunters theoretically arrive? I’m not sure the other films on the list are much help. Dead of Winter is out in less than two weeks. Mother Mary does not have a release date. Ne Zha 2 came out earlier this year. I’ve only seen news stories saying The Perfect Gamble was supposed to come out in Q1 2025, but I’ve seen no evidence that it actually has. KPop Demon Hunters Netflix It could be sooner rather than later as Netflix looks to capitalize…
Paylaş
BitcoinEthereumNews2025/09/18 02:23