XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks , booktitle =

Mohammad Rastegari, Vicente Ordonez, Joseph Redmon, Ali Farhadi , editor = · 2016 · DOI 10.1007/978-3-319-46493-0

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

open at publisher browse 8 citing papers

representative citing papers

Measuring What Matters Beyond Text: Evaluating Multimodal Summaries by Quality, Alignment, and Diversity

cs.AI · 2026-05-12 · unverdicted · novelty 7.0

MM-Eval unifies evaluation of multimodal summaries by integrating factual text quality, cross-modal relevance via MLLM judge, and visual diversity via truncated CLIP entropy, then calibrates their combination on human preferences.

MIRL: Mutual Information-Guided Reinforcement Learning for Vision-Language Models

cs.CV · 2026-05-02 · unverdicted · novelty 7.0

MIRL uses mutual information to guide trajectory selection and provide separate rewards for visual perception in RLVR for VLMs, achieving 70.22% average accuracy with 25% fewer full trajectories.

LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale

cs.LG · 2022-08-15 · conditional · novelty 7.0

LLM.int8() performs 8-bit inference for transformers up to 175B parameters with no accuracy loss by combining vector-wise quantization for most features with 16-bit mixed-precision handling of systematic outlier dimensions.

Towards Visually Grounded Multimodal Summarization via Cross-Modal Transformer and Gated Attention

cs.AI · 2026-05-12 · unverdicted · novelty 6.0

SPeCTrA-Sum uses hierarchical cross-modal fusion via DVP and DPP-distilled image selection via VRP to generate more accurate and visually grounded multimodal summaries.

Hidden in Plain Sight: Visual-to-Symbolic Analytical Solution Inference from Field Visualizations

cs.AI · 2026-04-10 · unverdicted · novelty 5.0

ViSA-R2 recovers single executable SymPy expressions for linear steady-state fields from visualizations using a self-verifying chain-of-thought that recognizes patterns, hypothesizes solution families, derives parameters, and checks consistency.

Quantization robustness from dense representations of sparse functions in high-capacity kernel associative memory

cs.NE · 2026-04-22 · unverdicted · novelty 4.0 · 2 refs

KLR Hopfield networks exhibit robustness to quantization but sensitivity to pruning, interpreted as arising from dense bimodal parameterization of sparse input mappings.

Gated-SwinRMT: Unifying Swin Windowed Attention with Retentive Manhattan Decay via Input-Dependent Gating

cs.LG · 2026-04-07 · unverdicted · novelty 4.0

Gated-SwinRMT unifies Swin windowed attention with retentive Manhattan decay via gating, reaching 80.22% top-1 accuracy on Mini-ImageNet versus 73.74% for the RMT baseline.

Developing a Strong Pre-Trained Base Model for Plant Leaf Disease Classification

cs.CV · 2026-05-02 · unverdicted · novelty 3.0

A DenseNet201 base model trained on a constructed plant leaf disease dataset outperforms baselines and enables faster, more robust transfer learning with less data than general models.

citing papers explorer

Showing 7 of 7 citing papers after filters.

Measuring What Matters Beyond Text: Evaluating Multimodal Summaries by Quality, Alignment, and Diversity cs.AI · 2026-05-12 · unverdicted · none · ref 185
MM-Eval unifies evaluation of multimodal summaries by integrating factual text quality, cross-modal relevance via MLLM judge, and visual diversity via truncated CLIP entropy, then calibrates their combination on human preferences.
MIRL: Mutual Information-Guided Reinforcement Learning for Vision-Language Models cs.CV · 2026-05-02 · unverdicted · none · ref 20
MIRL uses mutual information to guide trajectory selection and provide separate rewards for visual perception in RLVR for VLMs, achieving 70.22% average accuracy with 25% fewer full trajectories.
Towards Visually Grounded Multimodal Summarization via Cross-Modal Transformer and Gated Attention cs.AI · 2026-05-12 · unverdicted · none · ref 184
SPeCTrA-Sum uses hierarchical cross-modal fusion via DVP and DPP-distilled image selection via VRP to generate more accurate and visually grounded multimodal summaries.
Hidden in Plain Sight: Visual-to-Symbolic Analytical Solution Inference from Field Visualizations cs.AI · 2026-04-10 · unverdicted · none · ref 2
ViSA-R2 recovers single executable SymPy expressions for linear steady-state fields from visualizations using a self-verifying chain-of-thought that recognizes patterns, hypothesizes solution families, derives parameters, and checks consistency.
Quantization robustness from dense representations of sparse functions in high-capacity kernel associative memory cs.NE · 2026-04-22 · unverdicted · none · ref 5 · 2 links
KLR Hopfield networks exhibit robustness to quantization but sensitivity to pruning, interpreted as arising from dense bimodal parameterization of sparse input mappings.
Gated-SwinRMT: Unifying Swin Windowed Attention with Retentive Manhattan Decay via Input-Dependent Gating cs.LG · 2026-04-07 · unverdicted · none · ref 3
Gated-SwinRMT unifies Swin windowed attention with retentive Manhattan decay via gating, reaching 80.22% top-1 accuracy on Mini-ImageNet versus 73.74% for the RMT baseline.
Developing a Strong Pre-Trained Base Model for Plant Leaf Disease Classification cs.CV · 2026-05-02 · unverdicted · none · ref 124
A DenseNet201 base model trained on a constructed plant leaf disease dataset outperforms baselines and enables faster, more robust transfer learning with less data than general models.

XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks , booktitle =

fields

years

verdicts

representative citing papers

citing papers explorer