Rethinking the Inception Architecture for Computer Vision

Christian Szegedy, Jon Shlens, Sergey Ioffe, Vincent Vanhoucke, Zbigniew Wojna · 2016 · 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) · DOI 10.1109/cvpr.2016.308

17 Pith papers cite this work, alongside 21,437 external citations. Polarity classification is still indexing.

17 Pith papers citing it

21.4k external citations · Crossref

open at publisher browse 17 citing papers more from Christian Szegedy

citation-role summary

background 4

citation-polarity summary

background 3 support 1

authors

Christian Szegedy Jon Shlens Sergey Ioffe Vincent Vanhoucke Zbigniew Wojna

co-cited works

representative citing papers

Understanding deep learning requires rethinking generalization

cs.LG · 2016-11-10 · accept · novelty 8.0

State-of-the-art convolutional networks easily memorize random labels and unstructured noise images, indicating that generalization in deep learning cannot be explained by traditional capacity or regularization arguments.

Quantifying Explanation Consistency: The C-Score Metric for CAM-Based Explainability in Medical Image Classification

cs.CV · 2026-04-09 · unverdicted · novelty 7.0

The C-Score quantifies intra-class explanation consistency for CAM methods via confidence-weighted pairwise soft IoU and detects AUC-consistency dissociation as an early warning for model instability on chest X-ray classification.

SemanticBridge - A Dataset for 3D Semantic Segmentation of Bridges and Domain Gap Analysis

cs.CV · 2025-12-17 · unverdicted · novelty 7.0

SemanticBridge provides a new 3D dataset for bridge component segmentation and quantifies sensor-induced domain gaps that drop model performance by up to 11.4% mIoU.

Generate in Reconstruction Space, Match in Semantic Space: Transport Geometry for One-Step Generation

cs.LG · 2026-05-30 · unverdicted · novelty 6.0

Matching in semantic SSL feature space via Sinkhorn divergence enables effective one-step generation on ImageNet by inducing compact geometry for distribution matching, with training and evaluation features best kept distinct.

Flow Matching with Arbitrary Auxiliary Paths

cs.LG · 2026-05-07 · unverdicted · novelty 6.0

AuxPath-FM extends flow matching to arbitrary auxiliary distributions while preserving the continuity equation and marginal training objective.

P-Guide: Parameter-Efficient Prior Steering for Single-Pass CFG Inference

cs.AI · 2026-05-07 · unverdicted · novelty 6.0

P-Guide achieves single-pass classifier-free guidance in flow matching by modulating the initial latent state and is equivalent to standard CFG under a first-order approximation while cutting latency by half.

Explicit Dropout: Deterministic Regularization for Transformer Architectures

cs.LG · 2026-04-22 · unverdicted · novelty 6.0

Explicit dropout reformulates stochastic dropout as deterministic loss penalties for Transformers, matching or exceeding standard performance with independent control per component.

Rotary Masked Autoencoders are Versatile Learners

cs.LG · 2025-05-26 · unverdicted · novelty 6.0

RoMAE applies rotary positional embeddings to masked autoencoders to enable representation learning and interpolation on continuous positional data across irregular time-series, images, and audio without modality-specific modifications.

GAP3D: Generative Alignment of VLM Latents to Patch-Level Embeddings for 3D Generation

cs.CV · 2026-05-27 · unverdicted · novelty 5.0

GAP3D is a diffusion-based alignment technique that maps VLM latents to dense patch embeddings from image encoders, enabling modular VLM conditioning for 3D generation without 3D training data.

HeartBeatAI: An Interpretable and Robust Deep Learning Framework for Multi-Label ECG Arrhythmia Detection

cs.AI · 2026-05-23 · unverdicted · novelty 4.0

HeartBeatAI reports 98% Macro F1 under intra-source testing on four ECG datasets but shows significant degradation on rare anomalies under leave-one-domain-out evaluation.

CNNs for Vis-NIR Chemometrics: From Contradiction to Conditional Design

cs.LG · 2026-05-04 · unverdicted · novelty 4.0

Contradictions across CNN studies for Vis-NIR chemometrics are expected outcomes of uncontrolled variables in spectral physics and validation design, motivating a conditional rather than universal design framework.

A Compact and Efficient 1.251 Million Parameter Machine Learning CNN Model PD36-C for Plant Disease Detection: A Case Study

cs.CV · 2026-04-13 · unverdicted · novelty 4.0

PD36-C is a 1.25 million parameter CNN achieving 99.53% average test accuracy on 38 plant disease classes from the New Plant Diseases Dataset, with a Qt-based app enabling edge deployment.

AlphaQuanter: An End-to-End Tool-Augmented Agentic Reinforcement Learning Framework for Stock Trading

cs.CE · 2025-10-16 · unverdicted · novelty 4.0

AlphaQuanter introduces a single-agent tool-augmented RL framework for stock trading that learns dynamic policies over a transparent decision workflow and reports state-of-the-art financial metrics.

Explaining Machine Learning and Memorization with Statistical Mechanics

cs.LG · 2026-06-30 · unverdicted · novelty 3.0

Thesis uses statistical mechanics to study DAM and RBM models for understanding memorization, low-dimensional learning, and adversarial robustness in neural networks.

Developing a Strong Pre-Trained Base Model for Plant Leaf Disease Classification

cs.CV · 2026-05-02 · unverdicted · novelty 3.0

A DenseNet201 base model trained on a constructed plant leaf disease dataset outperforms baselines and enables faster, more robust transfer learning with less data than general models.

CNNs, Transformers, Hybrid, and Vision Language Models for Skin Cancer Detection

cs.CV · 2026-05-25 · unverdicted · novelty 2.0

Benchmark of twelve models finds hybrid CNN-transformer architectures and a SigLIP vision-language model deliver the strongest overall performance on skin cancer detection using the PAD-UFES-20 dataset.

Semi supervised GAN for smart microscopy, fast and data efficient cell cycle classification

q-bio.QM · 2026-04-22

citing papers explorer

Showing 17 of 17 citing papers.

Understanding deep learning requires rethinking generalization cs.LG · 2016-11-10 · accept · none · ref 7
State-of-the-art convolutional networks easily memorize random labels and unstructured noise images, indicating that generalization in deep learning cannot be explained by traditional capacity or regularization arguments.
Quantifying Explanation Consistency: The C-Score Metric for CAM-Based Explainability in Medical Image Classification cs.CV · 2026-04-09 · unverdicted · none · ref 42
The C-Score quantifies intra-class explanation consistency for CAM methods via confidence-weighted pairwise soft IoU and detects AUC-consistency dissociation as an early warning for model instability on chest X-ray classification.
SemanticBridge - A Dataset for 3D Semantic Segmentation of Bridges and Domain Gap Analysis cs.CV · 2025-12-17 · unverdicted · none · ref 74
SemanticBridge provides a new 3D dataset for bridge component segmentation and quantifies sensor-induced domain gaps that drop model performance by up to 11.4% mIoU.
Generate in Reconstruction Space, Match in Semantic Space: Transport Geometry for One-Step Generation cs.LG · 2026-05-30 · unverdicted · none · ref 40
Matching in semantic SSL feature space via Sinkhorn divergence enables effective one-step generation on ImageNet by inducing compact geometry for distribution matching, with training and evaluation features best kept distinct.
Flow Matching with Arbitrary Auxiliary Paths cs.LG · 2026-05-07 · unverdicted · none · ref 42
AuxPath-FM extends flow matching to arbitrary auxiliary distributions while preserving the continuity equation and marginal training objective.
P-Guide: Parameter-Efficient Prior Steering for Single-Pass CFG Inference cs.AI · 2026-05-07 · unverdicted · none · ref 44
P-Guide achieves single-pass classifier-free guidance in flow matching by modulating the initial latent state and is equivalent to standard CFG under a first-order approximation while cutting latency by half.
Explicit Dropout: Deterministic Regularization for Transformer Architectures cs.LG · 2026-04-22 · unverdicted · none · ref 30
Explicit dropout reformulates stochastic dropout as deterministic loss penalties for Transformers, matching or exceeding standard performance with independent control per component.
Rotary Masked Autoencoders are Versatile Learners cs.LG · 2025-05-26 · unverdicted · none · ref 59
RoMAE applies rotary positional embeddings to masked autoencoders to enable representation learning and interpolation on continuous positional data across irregular time-series, images, and audio without modality-specific modifications.
GAP3D: Generative Alignment of VLM Latents to Patch-Level Embeddings for 3D Generation cs.CV · 2026-05-27 · unverdicted · none · ref 6
GAP3D is a diffusion-based alignment technique that maps VLM latents to dense patch embeddings from image encoders, enabling modular VLM conditioning for 3D generation without 3D training data.
HeartBeatAI: An Interpretable and Robust Deep Learning Framework for Multi-Label ECG Arrhythmia Detection cs.AI · 2026-05-23 · unverdicted · none · ref 30
HeartBeatAI reports 98% Macro F1 under intra-source testing on four ECG datasets but shows significant degradation on rare anomalies under leave-one-domain-out evaluation.
CNNs for Vis-NIR Chemometrics: From Contradiction to Conditional Design cs.LG · 2026-05-04 · unverdicted · none · ref 44
Contradictions across CNN studies for Vis-NIR chemometrics are expected outcomes of uncontrolled variables in spectral physics and validation design, motivating a conditional rather than universal design framework.
A Compact and Efficient 1.251 Million Parameter Machine Learning CNN Model PD36-C for Plant Disease Detection: A Case Study cs.CV · 2026-04-13 · unverdicted · none · ref 61
PD36-C is a 1.25 million parameter CNN achieving 99.53% average test accuracy on 38 plant disease classes from the New Plant Diseases Dataset, with a Qt-based app enabling edge deployment.
AlphaQuanter: An End-to-End Tool-Augmented Agentic Reinforcement Learning Framework for Stock Trading cs.CE · 2025-10-16 · unverdicted · none · ref 18
AlphaQuanter introduces a single-agent tool-augmented RL framework for stock trading that learns dynamic policies over a transparent decision workflow and reports state-of-the-art financial metrics.
Explaining Machine Learning and Memorization with Statistical Mechanics cs.LG · 2026-06-30 · unverdicted · none · ref 209
Thesis uses statistical mechanics to study DAM and RBM models for understanding memorization, low-dimensional learning, and adversarial robustness in neural networks.
Developing a Strong Pre-Trained Base Model for Plant Leaf Disease Classification cs.CV · 2026-05-02 · unverdicted · none · ref 126
A DenseNet201 base model trained on a constructed plant leaf disease dataset outperforms baselines and enables faster, more robust transfer learning with less data than general models.
CNNs, Transformers, Hybrid, and Vision Language Models for Skin Cancer Detection cs.CV · 2026-05-25 · unverdicted · none · ref 25
Benchmark of twelve models finds hybrid CNN-transformer architectures and a SigLIP vision-language model deliver the strongest overall performance on skin cancer detection using the PAD-UFES-20 dataset.
Semi supervised GAN for smart microscopy, fast and data efficient cell cycle classification q-bio.QM · 2026-04-22 · unreviewed · ref 16

Rethinking the Inception Architecture for Computer Vision

citation-role summary

citation-polarity summary

authors

co-cited works

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer