FineVideo: Afine-graineddatasetforvideounderstanding.arXiv preprint arXiv:2405.00000

· 2024 · arXiv 2405.00000

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

DAIN: Dynamic Agent-Based Interaction Network for Efficient and Collaborative Multimodal Reasoning

cs.CL · 2026-06-29 · unverdicted · novelty 6.0

DAIN reframes multimodal fusion as dynamic agent collaboration with sparse activation, claiming SOTA results including 2.6% accuracy gain on ADNI across five benchmarks.

Decoupled Residual Quantization for Robust Semantic IDs in Recommendation

cs.IR · 2026-06-01 · unverdicted · novelty 6.0

Presents a diagnostic framework for semantic ID tokenizer failures using overlap and capacity metrics and proposes DRQ to separate geometry from distribution matching.

LiteFrame: Efficient Vision Encoders Unlock Frame Scaling in Video LLMs

cs.CV · 2026-05-17 · unverdicted · novelty 6.0

LiteFrame is an efficient vision encoder backbone trained with Compressed Token Distillation and Language Model Adaptation to scale frame count in Video LLMs while cutting latency and raising accuracy.

Uncovering Hidden Systematics in Neural Network Models for High Energy Physics

cs.LG · 2026-05-08 · unverdicted · novelty 6.0

Neural networks for HEP tasks can be fooled at significant rates by subtle perturbations inside uncertainty envelopes, revealing hidden systematics not captured by conventional methods.

SGTO-MAS: Secure Gorilla Troops Optimization for Multi-Agent LLM Systems

cs.CR · 2026-06-06 · unverdicted · novelty 3.0

SGTO-MAS applies Gorilla Troops Optimization to formulate multi-agent LLM coordination as a constrained optimization problem, reporting average performance of 0.5281, consensus 0.8764, risk 0.3000, and 4.04 agents selected across 500 runs.

citing papers explorer

Showing 5 of 5 citing papers.

DAIN: Dynamic Agent-Based Interaction Network for Efficient and Collaborative Multimodal Reasoning cs.CL · 2026-06-29 · unverdicted · none · ref 70
DAIN reframes multimodal fusion as dynamic agent collaboration with sparse activation, claiming SOTA results including 2.6% accuracy gain on ADNI across five benchmarks.
Decoupled Residual Quantization for Robust Semantic IDs in Recommendation cs.IR · 2026-06-01 · unverdicted · none · ref 2
Presents a diagnostic framework for semantic ID tokenizer failures using overlap and capacity metrics and proposes DRQ to separate geometry from distribution matching.
LiteFrame: Efficient Vision Encoders Unlock Frame Scaling in Video LLMs cs.CV · 2026-05-17 · unverdicted · none · ref 13
LiteFrame is an efficient vision encoder backbone trained with Compressed Token Distillation and Language Model Adaptation to scale frame count in Video LLMs while cutting latency and raising accuracy.
Uncovering Hidden Systematics in Neural Network Models for High Energy Physics cs.LG · 2026-05-08 · unverdicted · none · ref 14
Neural networks for HEP tasks can be fooled at significant rates by subtle perturbations inside uncertainty envelopes, revealing hidden systematics not captured by conventional methods.
SGTO-MAS: Secure Gorilla Troops Optimization for Multi-Agent LLM Systems cs.CR · 2026-06-06 · unverdicted · none · ref 9
SGTO-MAS applies Gorilla Troops Optimization to formulate multi-agent LLM coordination as a constrained optimization problem, reporting average performance of 0.5281, consensus 0.8764, risk 0.3000, and 4.04 agents selected across 500 runs.

FineVideo: Afine-graineddatasetforvideounderstanding.arXiv preprint arXiv:2405.00000

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer