archive
Every paper Pith has read. Search by title, abstract, or pith.
317 papers in eess.IV · page 1
-
FaSST matches LFNST gains at 84 percent lower complexity
FaSST: Fast Sparsifying Secondary Transform
-
Sonification lifts eye surgery event detection from 61 to 83 percent
Physics-Based iOCT Sonification for Real-time Interaction Awareness in Subretinal Injection
-
One model unifies filtering, smoothing and reanalysis via diffusion
ForcingDAS: Unified and Robust Data Assimilation via Diffusion Forcing
-
Keyed nonlinear transform cuts re-identification AUC 36% in medical split inference
Keyed Nonlinear Transform: Lightweight Privacy-Enhancing Feature Sharing for Medical Image Analysis
-
Implicit depth and Beer-Lambert law restore underwater images with 0.9M parameters
An Underwater Dehazing Network with Implicit Transmission Estimation
-
Diffusion model generates one-shot fluence maps for VMAT plans
Learning to Optimize Radiotherapy Plans via Fluence Maps Diffusion Model Generation and LSTM-based Optimization
-
Region selection among neural codecs reaches HEVC rates at single-codec cost
Spatial Competition for Low-Complexity Learned Image Compression
-
Bézier vessel encoding shifts disease predictions dose-responsively
A General B\'ezier Tree Encoding Counterfactual Framework for Retinal-Vessel-Mediated Disease Analysis
-
2D and 3D MRI models need different regularization for sparse data
Optimization in Sparse 2D to Dense 3D Weakly Supervised Learning: Application to Multi-Label Segmentation of Large ex vivo MRI Data
-
Stereo event cameras track 3D hand poses at 30 mm error
EgoEV-HandPose: Egocentric 3D Hand Pose Estimation and Gesture Recognition with Stereo Event Cameras
-
CycleGAN turns standard CT scans into usable low-dose training data
A Comparative Analysis of CT Degradation for LDCT Nodule Classification using Radiomics
-
Some frozen WSI-MIL models localize predictions to far fewer tiles
Are Compact Rationales Free? Measuring Tile Selection Headroom in Frozen WSI-MIL
-
Graph attention counts floors in street-view facades
GATA2Floor: Graph attention for floor counting in street-view facades
-
Transformer boosts UAV image PSNR by 5.7 dB with privacy
On Privacy-Preserving Image Transmission in Low-Altitude Networks: A Swin Transformer-Based Framework with Federated Learning
-
Radiomics guide diffusion model for label-free lung CT segmentation
DiffSegLung: Diffusion Radiomic Distillation for Unsupervised Lung Pathology Segmentation
-
Joint optimization raises low-field MRI quality without extra time
NexOP: Joint Optimization of NEX-Aware k-space Sampling and Image Reconstruction for Low-Field MRI
-
Calibrated adversarial stain aug reaches 93.9% on unseen slides
Physics-Grounded Adversarial Stain Augmentation with Calibrated Coverage Guarantees
-
Distillation across CT windows lifts AUC by 10-16 points
Uncovering Latent Pathological Signatures in Pulmonary CT via Cross-Window Knowledge Distillation
-
Frequency modules lift transformer accuracy on 3D medical scans
FEFormer: Frequency-enhanced Vision Transformer for Generic Knowledge Extraction and Adaptive Feature Fusion in Volumetric Medical Image Segmentation
-
Lightweight CNN hits 99 percent accuracy on brain tumor MRI scans
Brain Tumor Classification in MRI Images: A Computationally Efficient Convolutional Neural Network
-
Co-learning refines noisy labels in split federated medical segmentation
SplitFed-CL: A Split Federated Co-Learning Framework for Medical Image Segmentation with Inaccurate Labels
-
Fine-tuned language-vision model reaches 98% on SAR targets
Towards a Large Language-Vision Question Answering Model for MSTAR Automatic Target Recognition
-
Dataset turns satellite construction images into 2.3 million VQA examples
Geospatial-Temporal Sensemaking of Remote Sensing Activity Detections with Multimodal Large Language Model
-
One network registers cardiac MRI of any length or contrast
Set-Based Groupwise Registration for Variable-Length, Variable-Contrast Cardiac MRI
-
Online SAR processor focuses images line by line in 16 ms
Learning to Focus Synthetic Aperture Radar On-line with State-Space Models
-
Ray tracing lets microwave imaging see hidden targets
Polarization-Aware Ray-Tracing Enhanced Back-Projection Algorithm for Microwave Imaging in Complex Multipath Environments
-
Generative priors stable only in select imaging inverse problems
A Stability Benchmark of Generative Regularizers for Inverse Problems
-
Tube packages stabilize video recovery faster in semantic HARQ
Tube-Structured Incremental Semantic HARQ for Generative Video Receivers
-
Curated synthetic images boost real pose baselines at low cost
A Real-Calibrated Synthetic-First Data Engine
-
Jacobian metric selects tiny U-Nets at initialization
XTinyU-Net: Training-Free U-Net Scaling via Initialization-Time Sensitivity
-
Sensitivity metric selects 400x smaller U-Nets without training
XTinyU-Net: Training-Free U-Net Scaling via Initialization-Time Sensitivity
-
The paper describes a computational framework that reconstructs moving heart geometries…
Image-Based Whole-Heart Cardiac Flow Simulations in Health and Congenital Heart Disease
-
AI detects fetal brain bleeds without labeled scans
Annotation-free deep learning for detection and segmentation of fetal germinal matrix-intraventricular hemorrhage in brain MRI
-
Multi-layer CLIP similarities predict machine image preferences
ML-CLIPSim: Multi-Layer CLIP Similarity for Machine-Oriented Image Quality
-
Color-adaptive scheme raises 3D Gaussian streaming quality 5-20 dB
CAGS: Color-Adaptive Volumetric Video Streaming with Dynamic 3D Gaussian Splatting
-
Cross-modal vector lifts DR grading to 87.5% accuracy
Cross-Modal Semantic-Enhanced Diffusion Framework for Diabetic Retinopathy Grading
-
Weak supervision tracks retinal gaze below 0.45 deg error
Establishing Robust Retinal Eye Tracking: A Weakly Supervised Algorithmic Framework
-
Joint diffusion and relaxation MRI removes echo-time bias in muscle scans
Combined Diffusion-Relaxation MRI to Assess Muscle Microstructure and Composition
-
Open tool unifies methane analysis from five satellites
HyGAS: an Open, Sensor-Agnostic Platform for Multi-Satellite Methane Plume Retrieval, Uncertainty Propagation, and Emission-Rate Estimation
-
Tanager-1 joins PRISMA and EnMAP in methane plume framework
Multi-Sensor Methane Mapping in a Unified Framework: Tanager-1 Integration and comparison to EnMAP and PRISMA
-
Gaussian splatting relights VP scenes by sampling LED backgrounds directly
Relightable Gaussian Splatting for Virtual Production Using Image-Based Illumination
-
Neural network adapts frame rate and resolution for better streamed graphics
Streaming of rendered content with adaptive frame rate and resolution
-
Network impairments cut surgical teleoperation success to 12%
VISTA: A Benchmark for Real-Time Video Streaming under Network Impairments in Surgical Teleoperation
-
Multimodal training anchors the eigengap to recover more modes from few samples
Anchoring the Eigengap: Cross-Modal Spectral Stabilization for Sample-Efficient Representation Learning
-
Thin clients stream interactive 3D Gaussian Splatting over HTTP/3
Thin-Client Interactive Gaussian Adaptive Streaming over HTTP/3
-
Token compression cuts tracker MACs by 21% at 0.4% accuracy cost
An Efficient Token Compression Framework for Visual Object Tracking
-
Masks raise attention faithfulness over 35% in vision models
CAMAL: Improving Attention Alignment and Faithfulness with Segmentation Masks
-
Federated quantum model detects early retinopathy privately
FQPDR: Federated Quantum Neural Network for Privacy-preserving Early Detection of Diabetic Retinopathy
-
MCMC over DeepSDF latents yields calibrated uncertainty for heart shapes
Uncertainty Quantification for Cardiac Shape Reconstruction with Deep Signed Distance Functions via MCMC methods
-
Distance transform on contours boosts self-supervised depth accuracy
Improved monocular depth prediction using distance transform over pre-semantic contours with self-supervised neural networks