Spatial-Temporal Reasoning Examples

Breakthrough AI system helps self-driving cars remember the road

A self-driving car moves through traffic one moment at a time. A bus blocks part of the road. Rain throws reflections across ...

IEEE

Spatial-Temporal Clue Reasoning Chain for Long Video Question Answering

Abstract: Existing Video Question Answering (VideoQA) methods face tremendous challenges when dealing with longer videos. On the one hand, long videos contain rich and diverse information at different ...

GitHub

SprBull/nvidia-cosmos-cosmos-reason1

NVIDIA Cosmos Reason – an open, customizable, 7B-parameter reasoning vision language model (VLM) for physical AI and robotics - enables robots and vision AI agents to reason like humans, using prior ...

Frontiers

Time in mind: a multidisciplinary review on temporal perception, cognition, and memory

This review examines temporal cognition through the lens of Mental Time Travel (MTT): the subjective experience of recalling past events and using them to construct future scenarios. The analysis ...

Scientific Research Publishing

Burkholder, E.F. (2018) Example Projects-Chapter 15, The 3D Global Spatial Data Model: Principles and Applications. 2nd Edition, CRC Press.

ABSTRACT: Since about 1970, Geographic Information Systems (GISs) have been implemented as a tool to organize spatial data related to locations on or near the surface of the Earth. As technology ...

VentureBeat

Z.ai debuts open source GLM-4.6V, a native tool-calling vision model for multimodal reasoning

Chinese AI startup Zhipu AI aka Z.ai has released its GLM-4.6V series, a new generation of open-source vision-language models (VLMs) optimized for multimodal reasoning, frontend automation, and ...

GitHub

R-AVST: Empowering Video-LLMs with Fine-Grained Spatio-Temporal Reasoning in Complex Audio-Visual Scenarios

Recently, rapid advancements have been made in multimodal large language models (MLLMs), especially in video understanding tasks. However, current research focuses on simple video scenarios, failing ...

TechCrunch

Gemini 3 refused to believe it was 2025, and hilarity ensued

Every time you hear a billionaire (or even a millionaire) CEO describe how LLM-based agents are coming for all the human jobs, remember this funny but telling incident about AI’s limitations: Famed AI ...

Forbes

Robot Replacements, Galaxy XR, $133 Million Raised By General Intuition, ChatGPT’s Atlas Browser

Forbes contributors publish independent expert analyses and insights. A former tech executive covering AI and XR for Forbes. I have been doing a lot of research into the effects of AI on media, ...

GeekWire

Startup Radar: Seattle founders build AI tools for leadership training, spatial reasoning, vibe coding

GeekWire chronicles the Pacific Northwest startup scene. Sign up for our weekly startup newsletter, and check out the GeekWire funding tracker and VC directory. by Taylor Soper on Oct 6, 2025 at 12:55 ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results