AI Fails to Analyze Sports: Models Struggle with Reasoning

Researchers found that leading AI models like ChatGPT and Gemini performed poorly when tasked with analyzing sports footage, demonstrating limitations beyond basic observation. The study highlights a gap between AI's descriptive capabilities and its ability to understand context and predict outcomes.

June 6, 2026By ExstarHub Team

The hype around artificial intelligence continues to meet reality – sometimes with a resounding thud. Researchers recently put leading AI models like ChatGPT and Google’s Gemini through their paces analyzing basketball, soccer, and hockey, and the results were surprisingly underwhelming: they largely choked.

The SVI-Bench Test: A New Measure of AI Understanding

To assess AI’s capabilities beyond simple perception, researchers at the University of North Carolina and Northeastern University developed a new benchmark called Strategic Video Intelligence (SVI-bench). This wasn’t just about checking if an AI could identify a player; it aimed to evaluate its ability to understand *why* plays unfolded as they did. The test encompassed an enormous dataset – 35,000 hours of sports footage, 15 million annotated plays representing professional games, detailed analysis from experts, post-game reports, and extensive statistical records. The goal was to rigorously evaluate AI’s abilities in perception, reasoning, simulation, and agency—core traits that are proving difficult to assess with existing methods.

AI Can Spot Players, But Not Understand the Game

While AI showed reasonable success (around 74%) in identifying which player performed a specific action – essentially “eyeballing” what was happening on screen – that’s where its competence largely ended. This level of accuracy, while seemingly adequate for basic observation, would be considered insufficient even for amateur commentators. Causal reasoning—explaining *why* plays unfolded as they did—saw accuracy plummet to just below 50 percent. For instance, when researchers presented the models with a perplexing play where Cody Martin’s three-pointer bounced off the top of the backboard before landing in the bucket, ChatGPT incorrectly attributed it to “his first made three of the game,” demonstrating a complete lack of understanding of the unusual physics involved.

Predicting the Future: A Guessing Game

Simulation – predicting player movement and future events—proved even more challenging. The best-performing models were functionally no better than a coin flip when forecasting player trajectories, further diminishing with attempts to predict longer sequences toward goals or baskets. As researcher Lorenzo Torresani put it, “AI cannot tell you why things happen, and it cannot tell you what’s gonna happen next.” This lack of predictive capability goes beyond simply calculating ballistics; it indicates an inability to account for nuanced factors like player psychology, team strategy, and subtle shifts in momentum.

Why It Matters: Beyond the Bench

This isn’t just about sports broadcasting; it’s a broader reflection of AI’s limitations. The difficulty AI has in understanding context, anticipating outcomes, and making judgments mirrors challenges across many professions that require critical thinking and nuanced analysis—industries often touted for potential automation. The study’s findings carry significant implications: if AI struggles to grasp the intricacies of professional sports, its application to more complex fields like finance or healthcare could be similarly problematic without substantial advancements. The relentless narrative surrounding AI-driven job displacement might need recalibration; at least until AI can move beyond superficial observation and develop a deeper understanding of the world.

Key takeaways

AI struggles with causal reasoning and predictive capabilities in complex scenarios like sports analysis.
Current AI models primarily excel at identifying actions rather than understanding the underlying reasons or anticipating future events.
The limitations observed in sports analysis likely extend to other knowledge-based professions reliant on critical thinking.
Developing true AI proficiency requires more than just pattern recognition; it demands contextual understanding and predictive ability.
This study provides a valuable benchmark for assessing the progress of AI beyond superficial observation.

FAQ

What is SVI-bench?

SVI-bench (Strategic Video Intelligence) is a new testing framework created by researchers to evaluate how well AI models can understand and analyze complex scenarios, using sports footage as a test case.

Which AI models were tested?

The study evaluated several leading AI models including ChatGPT, Google’s Gemini, and the open-source model Qwen.

While sportscasters can breathe a sigh of relief for now, this analysis offers a much-needed dose of realism regarding the current state of AI capabilities and underscores the importance of human expertise in even seemingly data-rich fields.

Source: Futurism

The SVI-Bench Test: A New Measure of AI Understanding

AI Can Spot Players, But Not Understand the Game

Predicting the Future: A Guessing Game

Why It Matters: Beyond the Bench

Key takeaways

FAQ

What is SVI-bench?

Which AI models were tested?

Related Stories

Anthropic Reviews Cyber Tests After Claude Reached Real Systems

OpenAI Says Its Models Accessed Other Accounts During a Cyber Evaluation

Claude Shared Links Can Surface in Google: What to Review

Leave a Reply Cancel reply