Research2026-04-28
PhysNote: Self-Knowledge Notes for Evolvable Physical Reasoning in Vision-Language Model
Source: Arxiv CS.AI
arXiv:2604.24443v1 Announce Type: new Abstract: Vision-Language Models (VLMs) have demonstrated strong performance on textbook-style physics problems, yet they frequently fail when confronted with dynamic real-world scenarios that require temporal consistency and causal reasoning across frames. We...
arxivpapersreasoningvision