Benchmarking shows narrowcast communication and visual environment representations enable VLMs to reduce evacuation failure rates more effectively than broadcast or graph-based alternatives across varying map complexities and threat dynamics.
Crisissense-llm: Instruction fine-tuned large language model for multi-label social media text classification in disaster informatics.arXiv preprint arXiv:2406.15477
4 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
roles
background 1polarities
background 1representative citing papers
A survey categorizing scaling in LLM reasoning across input size, steps, rounds, training, and future directions, noting that scaling can negatively affect performance.
A survey synthesizing LLM and MM-LLM uses in transportation operations, mobility services, and decision support while noting challenges like data heterogeneity and real-time needs.
citing papers explorer
-
Guide Me Out: A Framework to Benchmark VLM Operators Communication in Crisis Scenarios
Benchmarking shows narrowcast communication and visual environment representations enable VLMs to reduce evacuation failure rates more effectively than broadcast or graph-based alternatives across varying map complexities and threat dynamics.
-
A Survey of Scaling in Large Language Model Reasoning
A survey categorizing scaling in LLM reasoning across input size, steps, rounds, training, and future directions, noting that scaling can negatively affect performance.
-
Large Language Models in Transportation Systems Management and Operations: From Text Reasoning to Multi-modal Decision Support
A survey synthesizing LLM and MM-LLM uses in transportation operations, mobility services, and decision support while noting challenges like data heterogeneity and real-time needs.