ALIGN applies an agentic vision-language framework with OCR, grid scanning, and 3-run geometric voting to reduce mean accident localization error from 10.9 km to 0.59 km on Bangla news and maps.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.AI 1years
2025 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
ALIGN: A Vision-Language Framework for High-Accuracy Accident Location Inference through Geo-Spatial Neural Reasoning
ALIGN applies an agentic vision-language framework with OCR, grid scanning, and 3-run geometric voting to reduce mean accident localization error from 10.9 km to 0.59 km on Bangla news and maps.