super hub Canonical reference

In: Proceedings of the IEEE/CVF Conference on Computer 25 Vision and Pattern Recognition, pp

Abdal, Peter, Rameen and Qin, year=, Yipeng and Wonka · 2020 · arXiv 2600.2020

Canonical reference. 71% of citing Pith papers cite this work as background.

194 Pith papers citing it

Background 71% of classified citations

read on arXiv browse 194 citing papers more from Abdal

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 32 dataset 9 method 5 baseline 3

citation-polarity summary

background 35 use dataset 6 use method 5 baseline 3

authors

Abdal Peter Rameen and Qin year= Yipeng and Wonka

co-cited works

representative citing papers

Rolling Shutter Relative Pose Estimation Made Practical

cs.CV · 2026-06-25 · conditional · novelty 8.0

A linearized solver estimates rolling-shutter relative pose and motion from 7 affine correspondences in 1.2 ms and reports best-in-benchmark accuracy plus usable translational velocity.

WildBox: A Dataset and Benchmark for Aerial Monocular 3D Detection of African Savanna Wildlife

cs.CV · 2026-06-19 · unverdicted · novelty 8.0

WildBox provides over 237k 3D wildlife annotations from drone video and benchmarks reveal zero-shot 3D detection at 0 AP but fine-tuned performance of 8.68 AP-BEV and 13.17 AP3D, with depth estimation causing most errors.

Mind2Web: Towards a Generalist Agent for the Web

cs.CL · 2023-06-09 · accept · novelty 8.0

Mind2Web is the first large-scale dataset of real-world web tasks for developing generalist language-guided agents that complete complex actions on diverse websites.

Learning Spectral and Polarimetric Clues for One-to-Multimodal Novel View Synthesis

cs.CV · 2026-07-02 · unverdicted · novelty 7.0

SPoILeR uses multimodal pre-training to enable accurate novel view synthesis of infrared, polarimetric, and multispectral data from RGB-supervised fine-tuning on new scenes.

MoHallBench: A Benchmark for Motion Hallucination in Video Large Language Models

cs.CV · 2026-07-01 · unverdicted · novelty 7.0

MoHallBench is a new benchmark evaluating motion hallucination in VideoLLMs from co-occurrence priors, sequential inference, and similarity confusion, revealing decoupling from action recognition performance.

PRISM-VO: Scale-Aware Visual Odometry Using Photometric Plenoptic Bundle Adjustment

cs.CV · 2026-06-30 · unverdicted · novelty 7.0

PRISM-VO introduces photometric plenoptic bundle adjustment for drift-resilient, metric-scale visual odometry from a single focused plenoptic camera.

Learning to Deny: Action Denial in Multimodal Large Language Models

cs.CV · 2026-06-30 · unverdicted · novelty 7.0 · 3 refs

MLLMs drop from over 85% accuracy on action presence to under 50% on matched action-denial videos, exposing a causal verification gap that causal graph prompts partially close.

Diffusion-Based Material Regularization for Physics-Based Inverse Rendering

cs.CV · 2026-06-30 · unverdicted · novelty 7.0

A regularization technique that treats diffusion model outputs as a similarity kernel during material optimization in inverse rendering, enabling joint reconstruction of geometry, materials, and illumination that satisfies the rendering equation and generalizes to new lighting.

HASTE: A Framework for Training-Free, Dynamic, and Steerable Compression of Pre-Trained Convolutional Neural Networks

cs.CV · 2026-06-29 · unverdicted · novelty 7.0 · 2 refs

HASTE enables training-free dynamic compression of pre-trained CNNs by patch-wise LSH-based merging of redundant channels, reporting 46.2% FLOPs reduction on ResNet34 CIFAR-10 with 1.25% accuracy drop.

AirGroundBench: Probing Spatial Intelligence in Multimodal Large Models under Heterogeneous Multi-View Embodied Collaboration

cs.CV · 2026-06-26 · unverdicted · novelty 7.0

AirGroundBench is a new diagnostic benchmark exposing that MLLMs handle basic spatial perception but struggle with cross-view alignment, transformation reasoning, and embodied navigation under heterogeneous air-ground views.

ScaLe-INR: Scale and Learn Implicit Neural Representations

cs.CV · 2026-06-26 · unverdicted · novelty 7.0

ScaLe-INR is a multi-branch INR architecture that applies directional scaling per the Fourier inverse theorem and a directional edge guidance loss to disentangle scales and improve reconstruction fidelity.

Semantic Browsing: Controllable Diversity for Image Generation

cs.CV · 2026-06-22 · unverdicted · novelty 7.0

A technique for controllable diversity in text-to-image generation by inducing structured semantic variations at the prompt level via VLM and agentic workflow.

4DVLT: Dynamic Scene Understanding with Worldline-Centered Vision-Language Tracking

cs.CV · 2026-06-21 · conditional · novelty 7.0 · 2 refs

The paper defines the 4DVLT task for worldline-centered 4D scene understanding, releases Instruct-4D with 129.4K QA pairs, and presents 4DTrack achieving 62.68 TGA_Top1, outperforming adapted baselines by 19.62 points.

Deep Unrolled Networks in Representation Space Applied to MRI Reconstruction

eess.IV · 2026-06-19 · unverdicted · novelty 7.0

DUNE enables exact data-consistency gradients via VJP when deep unrolled networks operate in representation space, yielding better MRI reconstructions than prior heuristic-DC variants.

Does Text Actually Help? Uncovering and Resolving Text Collapse in Multimodal Time Series Forecasting

cs.LG · 2026-06-17 · unverdicted · novelty 7.0

REST-TS resolves text collapse in multimodal time series forecasting by exclusively supervising the text branch on numerical residuals to compel genuine content extraction from text descriptions.

Human Universal Grasping

cs.RO · 2026-06-15 · unverdicted · novelty 7.0

HUG trains a flow-matching model on a new 1M-frame egocentric human grasp dataset to generate retargetable grasps from single RGB-D images, beating baselines by 23-34% on a new 90-object benchmark.

iSAGE: A Human-in-the-Loop Framework for Remote Sensing Semantic Segmentation via Sparse Point Supervision

cs.CV · 2026-06-08 · unverdicted · novelty 7.0

iSAGE achieves near-dense mIoU performance in remote sensing semantic segmentation using iterative expert clicks on confident model errors with an error-weighted loss, using only 0.011-0.04% of pixels.

Targeting World Models to Compromise Robot Learning Pipelines

cs.RO · 2026-06-08 · unverdicted · novelty 7.0

World models introduce a stealthy poisoning vector into robot learning pipelines where malicious prompts or dynamics in teleoperated data activate only during synthetic trajectory generation, enabling backdoors in downstream policies.

Bridging CAD and Data-Driven Design: Attributed Feature Graphs for Engineering Design

cs.CE · 2026-06-04 · unverdicted · novelty 7.0

Attributed Feature Graphs (AFGs) represent CAD features as attributed nodes and relations as directed edges to enable GNN surrogate models that predict design performance with feature-level interpretability on the CarHoods10K dataset.

LL-Bench: Rethinking Low-Level Vision Evaluation in the Era of Large-Scale Generative Models

cs.CV · 2026-06-01 · unverdicted · novelty 7.0

LL-Bench supplies a human-annotated dataset exposing generative model weaknesses in low-level restoration and introduces LL-Score as an MLLM evaluator that outperforms existing quality metrics and can serve as a training reward.

DPA4: Pushing the Accuracy-Cost Frontier of Interatomic Potentials with EMFA SO(2) Convolution

physics.chem-ph · 2026-06-01 · unverdicted · novelty 7.0

DPA4 is a new SE(3)-equivariant interatomic potential with EMFA SO(2) convolution that sets new accuracy-cost records on Matbench Discovery and SPICE benchmarks using fewer parameters than prior models.

Quality-Guided Semi-Supervised Learning for Medical Image Segmentation

cs.CV · 2026-06-01 · unverdicted · novelty 7.0

A new quality-guided approach for semi-supervised medical image segmentation that trains a predictor on synthetic errors to enhance pseudolabel handling.

DELOS: Detecting Shallow Transits in Kepler Photometry Using a Contrastive-Learning Framework

astro-ph.EP · 2026-05-28 · conditional · novelty 7.0 · 2 refs

DELOS applies contrastive learning to phase-folded light curves to detect shallow intermediate-to-long period transits, reporting 15.5% and 11.25% gains in combined precision-recall over BLS and TLS in low-SNR tests plus 3-80x speedups.

The Abstraction Gap in Vision-Language Causal Reasoning

cs.CL · 2026-05-27 · unverdicted · novelty 7.0

Introduces Abstraction Gap metric and CAGE benchmark showing seven of eight VLMs have large gaps between text plausibility and chain-based causal reasoning, with one model succeeding.

citing papers explorer

Showing 50 of 194 citing papers.

Beyond Pixel Overlap: A Framework for Decomposing Segmentation Evaluation Metrics cs.CV · 2026-07-01 · unverdicted · none · ref 5 · 2 links
A framework decomposes segmentation evaluation metrics into five modular stages to expose assumptions and compare design choices across metrics.
GenSP: Consistent Spherical Parameterization via Learning Shape Generative Models cs.CV · 2026-07-01 · unverdicted · none · ref 18
GenSP learns a continuous neural deformation model from sphere coordinates and latent codes to produce consistent spherical parameterizations for genus-0 shapes.
FLORA: A deep learning approach to predict forest attributes from heterogeneous LiDAR data cs.CV · 2026-06-30 · unverdicted · none · ref 32
FLORA is an octree-based deep learning framework with auxiliary data fusion that predicts forest attributes from heterogeneous LiDAR, achieving rRMSE of 12.3% for dominant height and 39% for total volume on 32k French NFI plots.
PGUDA: Pressure-Guided Unsupervised Domain Adaptation with Cross-Modal Knowledge Distillation for sEMG-Based Gesture Recognition eess.SP · 2026-06-30 · unverdicted · none · ref 39
PGUDA uses pressure signals to train a teacher network that distills modality-invariant knowledge into an sEMG student via cross-modal distillation, reaching 58.08% cross-subject accuracy with only 5% labeled data for the teacher.
Fleet: Few Shots Lead Effective AI-generated Image Detection cs.CV · 2026-06-30 · unverdicted · none · ref 17
Fleet achieves dynamic few-shot adaptation for AIGI detection via avoidance routing in decoupled subspaces, raising accuracy from 20.4% to 73.1% on new generators like Doubao Seedream 4.0 with 10 shots.
RBE-Flow: Recurrent Bayesian Estimation on Feature Manifolds for Cross-Modal Registration cs.CV · 2026-06-29 · unverdicted · none · ref 39
RBE-Flow recasts dense cross-modal flow estimation as closed-loop recurrent Bayesian estimation on learned feature manifolds with uncertainty-adaptive updates and achieves SOTA on three registration benchmarks.
Self-Supervised Calibration of Scientific Instruments Using Physical Consistency Constraints cs.LG · 2026-06-28 · unverdicted · none · ref 8
A physics-informed self-supervised framework learns detector calibration parameters and ionic charge-state predictions jointly from raw spectrometer data using iterative pseudo-labelling driven by physical constraints.
Confidence-feedback-weighted graph matching network: online-offline laser-induced damage site matching under complex interference cs.CV · 2026-06-28 · unverdicted · none · ref 45
A confidence-feedback-weighted graph matching network achieves 96.36% F1-score on damage site matching by using matchability confidence to weight edge features and applying geometric consistency and hard-example mining.
Few-Step Boltzmann Generators via Scalable Likelihood Flow Maps cs.LG · 2026-06-27 · unverdicted · none · ref 6
SCALLOP replaces Hutchinson's trace estimator with a scalable, vectorized likelihood distillation objective for F2D2 flow maps, cutting training variance and time while improving performance on molecular Boltzmann generators and image data.
Estimation--Prediction Tradeoff in Causal Probabilistic Temporal Graphs cs.LG · 2026-06-26 · unverdicted · none · ref 177
Characterizes an estimation-prediction tradeoff in binary logistic models for causal probabilistic temporal graphs and proposes a framework to jointly evaluate temporal link prediction with causal parameter recovery via Cramér-Rao bounds.
A Comparison of Fusion Techniques for Multi-Modal Human Activity Recognition on the HARMES Dataset cs.LG · 2026-06-26 · conditional · none · ref 40
Gated Multi-modal Fusion reaches 0.82 macro F1 on HARMES, beating the concatenation baseline of 0.76 by 6 points under leave-one-participant-out evaluation.
Differential Unfolding: Efficient Unfolding Reconstruction for Video Snapshot Compressive Imaging cs.CV · 2026-06-23 · unverdicted · none · ref 40
Differential Unfolding replaces uniform stacking in deep unfolding networks with a heterogeneous structure of anchoring and differential evolution stages to achieve better accuracy-efficiency trade-offs in video SCI reconstruction.
From Pixels to Concepts: Growing Rich 3D Semantic Scene Graph Forests utilizing Foundation Models cs.RO · 2026-06-22 · unverdicted · none · ref 17
Uses VLMs to detect instance concepts and LLMs to infer abstract relationships, assembling them into 3D scene graph forests that are evaluated on uHumans2 and ScanNet and tested in open-vocabulary retrieval on a Spot robot.
Unmasking LAION-5B: Age, Gender, Race, and Emotion Biases in Large-Scale Image Datasets cs.CV · 2026-06-22 · unverdicted · none · ref 95
Empirical audit of LAION-2B-en and LAION-2B-multi finds overrepresentation of young adults, White people, and males plus stereotypical emotion associations across two attribute classifiers.
Interpretable Uncertainty Routing Separating Emotion Ambiguity from Distribution Shift in Facial Expression Recognition cs.CV · 2026-06-21 · unverdicted · none · ref 28
Uncertainty decomposition via deep ensembles separates annotator disagreement from distribution shift in FER, enabling a routing mechanism that retains 1.8x more ambiguous faces at matched OOD rejection compared to single-uncertainty baselines.
Venice-H1: Failure-Aware Query Re-Ranking with Multi-Scale Grid Signatures for Referring Image Segmentation cs.CV · 2026-06-21 · unverdicted · none · ref 16
Venice-H1 improves failure-case mIoU by 0.89-1.40 points in referring image segmentation via multi-scale grid signatures and a failure-aware re-ranker, with positive CIs on all tested pairs and low harmful-switch rates.
Cross-View Yaw Estimation in Location Uncertainty with Line-Aligning Yaw Scoring cs.CV · 2026-06-20 · unverdicted · none · ref 27
Introduces LAYS, a radially invariant line-consensus voting method achieving sub-degree yaw precision in cross-view localization independent of location accuracy.
TimeProVe: Propose, then Verify for Efficient Long Video Temporal Reasoning in Activities of Daily Living cs.CV · 2026-06-18 · unverdicted · none · ref 40 · 2 links
TimeProVe proposes a propose-then-verify framework using lightweight action-based candidate evidence generation followed by targeted VLM verification for efficient long video temporal reasoning, achieving 7.3% improvement on OTB with 75% fewer VLM calls.
Temporal Preference Optimization for Unsupervised Retrieval cs.IR · 2026-06-16 · unverdicted · none · ref 5
TPOUR uses a novel TRPO method to improve unsupervised retrievers for temporal relevance, outperforming baselines including a much larger model on nDCG@5 for explicit and implicit time queries.
Reinforcing Dual-Path Reasoning in Spatial Vision Language Models cs.CV · 2026-06-16 · unverdicted · none · ref 122
SR-REAL equips spatial VLMs with dual LOR and DTR reasoning paths trained via RL, achieving better benchmark performance through mutual reinforcement and generalization without per-task tuning.
SceneMiner: Identity-Preserving Multi-Task Fine-Tuning for Unified BEV Scene Mining cs.CV · 2026-06-09 · unverdicted · none · ref 4
SceneMiner shows that identity-preserving multi-task fine-tuning removes cross-task interference by zero-initializing new heads and freezing shared-stream parameters, enabling unified BEV scene mining with preserved original heads.
Hybrid Robustness Verification for Spatio-Temporal Neural Networks cs.CV · 2026-06-08 · unverdicted · none · ref 62
STBP computes exact closed-form bounds for the first convolutional layer of spatio-temporal networks and propagates scalable approximations through the rest to certify robustness under subset-frame or patch perturbations.
SOMA: From Surface Observations to Muscle Anatomy cs.CV · 2026-06-08 · unverdicted · none · ref 59
SOMA recovers spatio-temporal muscle behavior from multi-view RGB surface data and introduces the SKIM soft-tissue deformation dataset as the first such method from RGB observations.
VeriDrive: Verifiable Counterfactual Supervision for Cost-Efficient Vision-Language Planning cs.CV · 2026-06-05 · unverdicted · none · ref 26
VeriDrive introduces a verifiable counterfactual supervision framework using a Perception-Evaluation-Revision chain and validator-guided correction to generate cost-efficient structured data for vision-language driving models, showing metric gains on nuScenes.
DREAM: Dynamic Refinement of Early Assignment Mappings cs.IR · 2026-06-05 · unverdicted · none · ref 14
DREAM proposes intent-aware tokenization, frozen-model evaluation, and dynamic beams to refine early SID assignments and improve cold-start performance in generative recommenders on Amazon benchmarks.
COMAP: Co-Evolving World Models and Agent Policies for LLM Agents cs.AI · 2026-06-01 · unverdicted · none · ref 3
COMAP co-evolves textual world models and agent policies for LLMs through on-policy self-distillation, yielding up to 16.75% relative gains on embodied planning, web navigation, and tool-use tasks.
Before the Shutter: Aesthetic and Actionable Portrait Photography Planning in 3D Scenes cs.GR · 2026-05-28 · unverdicted · none · ref 1
Introduces Photographic Scene Graph and aesthetic-guided comparative planning to generate physically feasible and human-preferred portrait plans in 3D scenes.
Deep Psychovisual Image Representations cs.CV · 2026-05-28 · unverdicted · none · ref 32
Proposes a psychovisual-inspired deep learning method that encodes images in learned frequency sub-bands for interpretable semantic structures and reduced depth dependence.
MuNet: A Mutualistic Network for Joint 3D Human Mesh Recovery and 3D Clothed Human Reconstruction from Single Images cs.CV · 2026-05-25 · unverdicted · none · ref 7 · 3 links
MuNet is an end-to-end graph convolutional network using 2-manifold graphs and a mutualistic training mechanism that jointly optimizes 3D human mesh recovery and clothed reconstruction, reporting state-of-the-art results on six benchmarks.
ARCANE-PedSynth: Synthetic Multi-Pedestrian Datasets with Behavioural Crossing Annotations cs.RO · 2026-05-24 · unverdicted · none · ref 6 · 2 links
ARCANE-PedSynth is a CARLA-based framework that generates synthetic multi-pedestrian datasets with behavioral crossing annotations by using hybrid AI-manual control to raise crossing rates and a 12-state FSM for diverse behaviors.
MR-LiDAR: A Multi-Resolution Roadside LiDAR Benchmark for Perception Diagnostics and Deployment Guidance cs.RO · 2026-05-23 · unverdicted · none · ref 18 · 2 links
MR-LiDAR benchmark shows an 80-beam LiDAR with optimized distribution can match or exceed 128-beam uniform LiDAR for roadside vehicle and VRU detection.
HyperGuide: Hyperbolic Guidance for Efficient Multi-Step Reasoning in Large Language Models cs.AI · 2026-05-22 · unverdicted · none · ref 3
HyperGuide projects LLM hidden states into hyperbolic space to create a distance-to-origin signal for solution proximity and uses it to guide multi-step generation via a trained head and low-rank adapter.
Automatic Discovery of Disease Subgroups by Contrasting with Healthy Controls cs.LG · 2026-05-20 · conditional · none · ref 2 · 2 links
Deep UCSL uses a contrastive EM loss on patient-control labels to isolate disease-driven subgroups in medical imaging by suppressing shared healthy variability.
LiFT: Lifted Inter-slice Feature Trajectories for 3D Image Generation from 2D Generators cs.CV · 2026-05-18 · unverdicted · none · ref 21
LiFT factorizes 3D medical volume synthesis into per-slice 2D generation and inter-slice trajectory learning, using a tri-planar drifting loss for unconditional coherence and a z-context mixer for paired translation tasks.
3DTMDet: A Dual-Path Synergy Network of Transformer and SSM for 3D Object Detection in Point Clouds cs.CV · 2026-05-15 · unverdicted · none · ref 41
3DTMDet proposes a hybrid Mamba-Transformer architecture with a 3DHMT block and LiDAR-inspired voxel generation to improve 3D object detection in point clouds, outperforming prior methods on KITTI and ONCE datasets.
A General Differentiable Ray-Wave Framework for Hybrid Refractive-Diffractive System Modeling and Optimization physics.optics · 2026-05-14 · unverdicted · none · ref 130
A plug-and-play differentiable model bridging ray and wave optics for hybrid systems that enables end-to-end optimization of planar and conformal diffractive elements.
Deep Pre-Alignment for VLMs cs.CV · 2026-05-14 · unverdicted · none · ref 86
Deep Pre-Alignment uses a small VLM perceiver instead of ViT to pre-align visual features with LLM text space, yielding 1.9-3.0 point gains on multimodal benchmarks and 32.9% less language forgetting.
MULTI: Disentangling Camera Lens, Sensor, View, and Domain for Novel Image Generation cs.CV · 2026-05-12 · unverdicted · none · ref 4
MULTI uses two-stage textual inversion to disentangle camera lens, sensor, view, and domain factors for novel image generation, supporting dataset extension and ControlNet modifications on the new DF-RICO benchmark.
Self-organized MT Direction Maps Emerge from Spatiotemporal Contrastive Optimization q-bio.NC · 2026-05-12 · unverdicted · none · ref 13
Direction maps and pinwheel structures in MT emerge spontaneously when a spatiotemporal deep network is trained on videos with contrastive self-supervised learning and spatial regularization.
Weather-Robust Cross-View Geo-Localization via Prototype-Based Semantic Part Discovery cs.CV · 2026-05-12 · unverdicted · none · ref 50 · 2 links
SkyPart achieves state-of-the-art single-pass cross-view geo-localization on SUES-200, University-1652, and DenseUAV by using prototype-based part discovery, altitude-conditioned modulation, and Kendall-weighted loss, with widening gains under weather corruptions.
Learning Point Cloud Geometry as a Statistical Manifold: Theory and Practice cs.RO · 2026-05-11 · unverdicted · none · ref 53
Point cloud geometry is cast as a statistical manifold of per-point Gaussians, with POLI learning the mapping self-supervisedly to improve perception without labeled data.
MAG-VLAQ: Multi-modal Aerial-Ground Query Aggregation for Cross-View Place Recognition cs.CV · 2026-05-10 · unverdicted · none · ref 5 · 2 links
MAG-VLAQ fuses multi-modal ground and aerial data via ODE-conditioned vector-of-locally-aggregated-queries to nearly double recall@1 on aerial-ground place recognition benchmarks.
Removing the Watermark Is Not Enough: Forensic Stealth in Generative-AI Watermark Removal cs.CR · 2026-05-09 · unverdicted · none · ref 9 · 2 links
Current AI image watermark removal attacks replace the watermark with a different forensic signal, allowing independent detectors to distinguish processed outputs from clean images at over 98% true-positive rate under a 1% false-positive budget.
Experience Sharing in Mutual Reinforcement Learning for Heterogeneous Language Models cs.LG · 2026-05-08 · unverdicted · none · ref 98
Mutual Reinforcement Learning allows heterogeneous LLMs to exchange experience through mechanisms like Peer Rollout Pooling, Cross-Policy GRPO Advantage Sharing, and Success-Gated Transfer, with outcome-level sharing identified as favorable on the stability-support trade-off.
Generalized Category Discovery in Federated Graph Learning cs.LG · 2026-05-05 · unverdicted · none · ref 31
GCD-FGL mitigates neighborhood absorption and global semantic inconsistency in federated generalized category discovery, delivering +4.86 average HRScore gain over baselines on five graph datasets.
QuIDE: Mastering the Quantized Intelligence Trade-off via Active Optimization cs.LG · 2026-05-05 · unverdicted · none · ref 7
QuIDE defines the Intelligence Index I = (C × P) / log₂(T+1) as a unified score for the compression-accuracy-latency trade-off in quantized neural networks, with experiments showing task-dependent optimal bit widths.
SHIELD: A Diverse Clinical Note Dataset and Distilled Small Language Models for Enterprise-Scale De-identification cs.CL · 2026-05-05 · unverdicted · none · ref 6
SHIELD is a new diverse clinical note dataset paired with distilled small language models that achieve 0.89 span-level precision and 0.88 recall for on-premise PHI de-identification.
Model Merging: Foundations and Algorithms cs.LG · 2026-05-02 · unverdicted · none · ref 81
New cycle-consistent optimization, task vector theory, singular vector decompositions, adaptive routing, and efficient evolutionary search provide foundations for merging neural network weights across tasks.
HypEHR: Hyperbolic Modeling of Electronic Health Records for Efficient Question Answering cs.AI · 2026-04-22 · unverdicted · none · ref 115 · 2 links
HypEHR is a hyperbolic embedding model for EHR data that uses Lorentzian geometry and hierarchy-aware pretraining to answer clinical questions nearly as well as large language models but with much smaller size.
Where are they looking in the operating room? cs.CV · 2026-04-22 · unverdicted · none · ref 8
Gaze-following models on extended 4D-OR and Team-OR datasets reach F1 scores of 0.92 for clinical role prediction and 0.95 for surgical phase recognition while improving team communication detection by over 30%.

In: Proceedings of the IEEE/CVF Conference on Computer 25 Vision and Pattern Recognition, pp

hub tools

citation-role summary

citation-polarity summary

authors

co-cited works

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer