SEED: Semi-supervised Continual MalwarE Detection for Tackling ConcEpt Drift on a BuDget

Bheemarjuna Reddy Tamma; Bikraj Shresta; Siva Ram murthy Chebiyam; Sumohana S Channappayya; Suresh Kumar Amalapuram

arxiv: 2605.24903 · v2 · pith:OSROILBUnew · submitted 2026-05-24 · 💻 cs.CR · cs.LG

SEED: Semi-supervised Continual MalwarE Detection for Tackling ConcEpt Drift on a BuDget

Suresh Kumar Amalapuram , Bikraj Shresta , Siva Ram murthy Chebiyam , Bheemarjuna Reddy Tamma , Sumohana S Channappayya This is my paper

Pith reviewed 2026-06-30 00:15 UTC · model grok-4.3

classification 💻 cs.CR cs.LG

keywords malware detectionconcept driftsemi-supervised learningcontinual learningactive learningsingular value decompositionrepresentation consistencydelayed buffer update

0 comments

The pith

SEED detects unseen malware using only 20 percent labeled data on prior tasks by projecting samples into an SVD-derived representation space and selecting uncertain ones via cosine distance.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces SEED to keep malware detectors effective as applications change over time, even when most training data remains unlabeled. It replaces hierarchical contrastive loss with a binary cross-entropy objective that pairs projected unlabeled samples from seen tasks with labeled ones to enforce representation consistency. For fully unlabeled new tasks, the same space supplies uncertainty scores that guide active learning. A reader cares because label collection in security is costly, so methods that maintain performance with far less supervision could extend detector lifetime without constant analyst effort.

Core claim

SEED is a semantic-structure-agnostic approach that combines a tailored binary cross-entropy loss with semi-supervised continual learning and active learning; unlabeled samples from seen tasks are projected via singular value decomposition into a space built from prior data to encourage consistency, while cosine distance in that space quantifies uncertainty to select samples for labeling on unseen tasks, yielding 40 percent AUT gains on BODMAS and 14 percent on AndroZoo with 20 percent labeled data relative to the semi-supervised baseline.

What carries the argument

The SVD-constructed representation space from previously seen data, which projects unlabeled samples for consistency on seen tasks and supplies cosine-distance uncertainty scores for active learning on unseen tasks.

If this is right

SEED remains competitive with the baseline on APIGraph while improving on BODMAS and AndroZoo under 20 percent labeling.
A delayed buffer update reduces label noise propagation during replay and improves learning stability across tasks.
The same pipeline applies to both Windows and Android malware datasets under partial and full unlabeled settings.
Active learning on unseen tasks uses only the representation space built from seen tasks, avoiding the need for semantic structure assumptions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The projection-and-distance mechanism might transfer to other continual-learning settings that face concept drift with scarce labels, such as network intrusion detection.
Lowering the labeled fraction below 20 percent on seen tasks would test how far the SVD consistency signal can be stretched before performance collapses.
Combining the delayed buffer update with other replay strategies could further stabilize training when label noise is high.
The method's reliance on a single fixed representation space raises the question of how often that space must be rebuilt as the number of seen tasks grows.

Load-bearing premise

Unlabeled samples can be projected into an SVD-built space from earlier tasks to produce reliable consistency and uncertainty signals.

What would settle it

An experiment that replaces the SVD-derived space with random vectors or replaces cosine-distance selection with random selection and measures whether the reported AUT improvements on BODMAS and AndroZoo disappear.

Figures

Figures reproduced from arXiv: 2605.24903 by Bheemarjuna Reddy Tamma, Bikraj Shresta, Siva Ram murthy Chebiyam, Sumohana S Channappayya, Suresh Kumar Amalapuram.

**Figure 1.** Figure 1: Comparison of SEED with HCL∗ (HCL trained on partially labeled data) for unseen malware detection on the BODMAS and AndroZoo datasets. The BODMAS dataset spans twelve months; SEED and HCL∗ are trained on the first five months and evaluated on the remaining seven months. Similarly, AndroZoo spans three years (36 months); the malware detectors are trained on the first twelve months and evaluated on the remai… view at source ↗

**Figure 2.** Figure 2: Graphical illustration of the end-to-end training process of the proposed SSCL method for seen tasks. effectively capture the most salient features of the encoded data, reducing storage costs associated with storing all the encoded representations (Eq 2) particularly for longer sequence of tasks. The basis vectors of the latent representation are derived using singular value decomposition (SVD). SVD decomp… view at source ↗

**Figure 3.** Figure 3: Illustration of buffer memory sample storage and retrieval policy. At the beginning, memory is [PITH_FULL_IMAGE:figures/full_fig_p014_3.png] view at source ↗

**Figure 4.** Figure 4: Comparing the proposed method (Ours/SEED with labeling budget of 100 samples per month) with the CADE and HCL∗ using the PR-AUC metric on the detection capability of unseen tasks malware. Each experiment is repeated for three different seed values and mean PR-AUC values are used for demonstration. Our method consistently outperforms CADE and HCL∗ on BODMAS and AndroZoo on PR-AUC metric, stays competitive o… view at source ↗

**Figure 5.** Figure 5: Comparing the proposed method (Ours/SEED with labeling budget 100) with the CADE and HCL∗ using the PR-AUC metric on the detection capability of unseen tasks benign samples. Each experiment is repeated for three different seed values and mean PR-AUC values are used for demonstration. Our method stays competitive/outperforms CADE and HCL∗ on BODMAS, AndroZoo and APIGraph on PR-AUC metric, except on couple o… view at source ↗

**Figure 6.** Figure 6: Comparing the proposed method (Ours with labeling budget 100) with the CADE and HCL [PITH_FULL_IMAGE:figures/full_fig_p020_6.png] view at source ↗

**Figure 7.** Figure 7: Comparing the proposed method (Ours/SEED with labeling budget 100) with the CADE and HCL∗ using the PR-AUC metric on the detection capability of seen tasks benign samples. Each experiment is repeated for three different seed values and mean PR-AUC values are used for demonstration. Our method consistently outperforms CADE and HCL∗ on BODMAS and APIGraph on PR-AUC metric, stays competitive on AndroZoo datas… view at source ↗

**Figure 8.** Figure 8: Comparing the effect of the label noise with varying label ratio and mitigating its effect using the SEED method on BODMAS, AndroZoo and APIGraph 6.7 Effect of label noise on detection performance In this section, we study how increasing levels of label noise (via label flipping) affect the detection performance on unseen malware. Specifically, we vary the label-noise ratio from 10% to 80% and analyze how … view at source ↗

**Figure 9.** Figure 9: Operating-point comparison of SEED with baseline detectors on the AndroZoo dataset at fixed false positive rate (FPR) thresholds. The left column presents CADE vs SEED, the right column shows HCL vs SEED, and the bottom panel shows HCL∗ vs SEED. Within each comparison group, the three plots correspond to FPR levels 10−1 , 10−2 , and 10−3 . Across all plots, the red curves denote SEED, whereas the blue curv… view at source ↗

**Figure 10.** Figure 10: Operating-point comparison of SEED with baseline detectors on the BODMAS dataset at fixed false positive rate (FPR) thresholds. The left column presents CADE vs SEED, the right column shows HCL vs SEED, and the bottom panel shows HCL∗ vs SEED. Within each comparison group, the three plots correspond to FPR levels 10−1 , 10−2 , and 10−3 . Across all plots, the red curves denote SEED, whereas the blue curve… view at source ↗

**Figure 11.** Figure 11: Operating-point comparison of SEED with baseline detectors on the APIGraph dataset at fixed false positive rate (FPR) thresholds. The top, middle, and bottom rows show CADE vs SEED, HCL vs SEED, and HCL∗ vs SEED, respectively. Each column corresponds to an FPR constraint of 10−1 , 10−2 , 10−3 , and 10−4 . Across all plots, red curves denote SEED and blue curves denote the corresponding baseline method. , … view at source ↗

**Figure 12.** Figure 12: Comparing the detection performance (Unseen-AUT(A)) for self labeling and analysts labeling with varying labeled data ratio. A pronounced gap is observed on AndroZoo, BODMAS remains comparatively close, and APIGraph shows stable analyst performance while selflabeling degrades at higher ratios. In this subsection, we compare two labeling strategies for samples from unseen tasks: self-labeling, where th… view at source ↗

**Figure 13.** Figure 13: Comparing the detection performance (Unseen-AUT(A)) under centroid distance and allsamples distance strategies with varying labeled data ratios on BODMAS, AndroZoo, and APIGraph datasets. APIGraph consistently achieves the highest performance, while BODMAS and AndroZoo show gradual improvements as the labeled data ratio increases. AndroZoo. Both strategies maintain consistently high performance across … view at source ↗

**Figure 14.** Figure 14: Comparing the detection performance (Unseen-AUT(A)) under varying SVD thresholds with different labeled data ratios on BODMAS, AndroZoo, and APIGraph datasets. Overall, the 95% SVD threshold consistently achieves competitive or superior performance across datasets, indicating that retaining a larger proportion of latent variance preserves discriminative information for unseen malware detection. AndroZoo… view at source ↗

read the original abstract

Machine learning based malware detectors become obsolete over time due to concept drift in benign and malware applications. Recent methods rely on fully labeled data and use hierarchical contrastive loss (HCL) with active learning to improve robustness against drift by exploiting semantic structure in malware representations. However, obtaining labeled data in the security domain is difficult. Under partially labeled settings, HCL suffers significant performance degradation in detecting unseen malware, especially on datasets such as BODMAS where strong semantic structure may not exist. In this paper, we propose SEED, a semantic-structure-agnostic method for malware detection under limited supervision. SEED combines a tailored binary cross-entropy objective with semi-supervised continual learning and active learning. For partially labeled seen tasks, unlabeled samples are projected into a representation space constructed from previously seen data using singular value decomposition, and paired with suitable labeled samples to encourage representation consistency. For unseen tasks with fully unlabeled data, uncertainty is quantified using cosine distance in representation space, and the most uncertain samples are selected for analyst labeling. We evaluate SEED on both Windows and Android malware datasets. Using only 20% labeled data on seen tasks, SEED achieves average AUT improvements of 40% on BODMAS and 14% on AndroZoo for unseen malware detection compared to HCL* (the semi-supervised adaptation of HCL), while remaining competitive on APIGraph. Finally, we introduce a delayed buffer update strategy to reduce label noise propagation during replay and improve learning stability.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

SEED adapts HCL with SVD projection and cosine-distance sampling to handle 20% labels in continual malware detection, claiming solid gains on BODMAS and AndroZoo over the semi-supervised baseline.

read the letter

SEED targets the drop in performance that HCL shows when labels are scarce by dropping reliance on semantic structure. It uses a tailored binary cross-entropy loss, projects unlabeled samples from seen tasks into an SVD-built space from prior data to keep representations consistent, and switches to cosine distance in that space to pick uncertain samples for labeling on fully unlabeled new tasks. A delayed buffer update is added to limit noise during replay.

The concrete pipeline is new relative to the cited HCL work, and the direct comparison on BODMAS, AndroZoo, and APIGraph under the 20% label regime is the main empirical contribution. The approach is pragmatic for security settings where labels cost analyst time and drift is constant.

The abstract states 40% and 14% AUT lifts on two datasets and competitiveness on the third, but supplies no splits, error bars, or ablation on the SVD and cosine choices. Without those, it is difficult to tell whether the gains are stable or whether the representation space behaves as assumed. The claim that cosine distance reliably flags uncertainty for active learning on unseen tasks is plausible but untested in the summary.

This is for researchers who already work on continual or active learning for malware and want a label-efficient variant that does not require strong semantic clusters. A reader who needs to ship a detector under budget constraints could extract the method and test it.

The work deserves peer review because it engages a real operational constraint with a clear alternative to the prior baseline, even if the experiments will need tightening.

Referee Report

0 major / 2 minor

Summary. The paper proposes SEED, a semantic-structure-agnostic method for semi-supervised continual malware detection under concept drift and limited labels. It augments a binary cross-entropy objective with semi-supervised continual learning and active learning: unlabeled samples from partially labeled seen tasks are projected via SVD into a representation space derived from prior data to enforce consistency, while fully unlabeled unseen tasks use cosine-distance uncertainty sampling for active labeling. A delayed buffer update is added to mitigate label noise in replay. On BODMAS, AndroZoo, and APIGraph, SEED with 20% labeled data on seen tasks reports average AUT gains of 40% and 14% over the semi-supervised HCL* baseline for unseen malware detection (competitive on APIGraph).

Significance. If the empirical gains prove robust, the work is significant for security ML because it directly targets the high cost of labeling and the rapid obsolescence of detectors due to drift. The SVD-projection and cosine-distance mechanisms provide a lightweight way to leverage unlabeled data without requiring strong semantic structure, and the delayed-buffer heuristic addresses a practical stability issue in replay-based continual learning. These elements could inform future semi-supervised continual-learning pipelines in adversarial domains.

minor comments (2)

[Abstract] Abstract: performance numbers (40% and 14% AUT) are stated without reference to the number of runs, error bars, dataset splits, or the precise definition of AUT; the main text should make these details immediately locatable (e.g., via a dedicated experimental-setup subsection or table footnote).
The stylized title capitalization (SEED, MalwarE, ConcEpt, BuDget) is a minor distraction for citation and indexing; consider a conventional title or a parenthetical expansion on first use.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive summary, recognition of the work's significance in security ML, and recommendation for minor revision. No specific major comments were listed in the report.

Circularity Check

0 steps flagged

No significant circularity in derivation chain

full rationale

The paper proposes an empirical semi-supervised continual learning method (SEED) that combines binary cross-entropy loss with SVD-based projection for representation consistency on partially labeled seen tasks and cosine-distance uncertainty sampling for active learning on unseen tasks. Performance claims rest on direct empirical comparisons to an externally adapted baseline (HCL*) across independent datasets (BODMAS, AndroZoo, APIGraph) under 20% labeling. No equations, predictions, or uniqueness claims reduce to self-definition, fitted inputs renamed as outputs, or load-bearing self-citations. The construction is self-contained against external benchmarks with no internal reduction to inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

Central claim depends on two domain assumptions about representation consistency via SVD and uncertainty via cosine distance; no free parameters or invented entities are identifiable from the abstract.

axioms (2)

domain assumption Unlabeled samples can be projected into an SVD-derived representation space from seen data to encourage consistency with labeled samples.
Core mechanism stated for partially labeled seen tasks.
domain assumption Cosine distance in the representation space quantifies sample uncertainty for selecting labels on unseen tasks.
Used to drive active learning on fully unlabeled unseen data.

pith-pipeline@v0.9.1-grok · 5836 in / 1499 out tokens · 53953 ms · 2026-06-30T00:15:45.338219+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

53 extracted references · 2 canonical work pages · 2 internal anchors

[1]

Imagenet classification with deep convolutional neural networks

Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. “Imagenet classification with deep convolutional neural networks”. In:in Proceedings of the Advances in Neural Information Processing Systems25 (2012)

2012
[2]

Attention is All you Need

Ashish Vaswani et al. “Attention is All you Need”. In:Proceedings of the Advances in Neural Information Processing Systems. Vol. 30. 2017

2017
[3]

https://media.kaspersky.com/en/enterprise- security/KasperskyLab-Whitepaper-Machine-Learning.pdf

Kaspersky Lab Whitepaper Machine Learning. https://media.kaspersky.com/en/enterprise- security/KasperskyLab-Whitepaper-Machine-Learning.pdf. [4]Themida - Advanced Windows Software Protection. https://www.oreans.com/Themida.php
[4]

https://www.oreans.com/ CodeVirtualizer.php

CodeVirtualizer - Obfuscation System Against Reverse Engineering. https://www.oreans.com/ CodeVirtualizer.php
[5]

Drift Forensics of Malware Classifiers

Theo Chow et al. “Drift Forensics of Malware Classifiers”. In:Proceedings of the 16th ACM Workshop on Artificial Intelligence and Security (AISec). 2023, pp. 197–207. , Vol. 1, No. 1, Article . Publication date: May 2026. 40 Suresh Kumar Amalapuram, Bikraj Shresta, Siva Ram Murthy Chebiyam, Bheemarjuna Reddy Tamma, and Sumohana Channappayya

2023
[6]

Robust semi-supervised learning when not all classes have labels

Lan-Zhe Guo et al. “Robust semi-supervised learning when not all classes have labels”. In:in Proceedings of the Advances in Neural Information Processing Systems35 (2022), pp. 3305–3317

2022
[7]

Open-World Semi-Supervised Learning

Kaidi Cao, Maria Brbic, and Jure Leskovec. “Open-World Semi-Supervised Learning”. In: Proceedings of the International Conference on Learning Representations. 2022.url: https: //openreview.net/forum?id=O-r8LOR-CCA

2022
[8]

A comprehensive survey of continual learning: Theory, method and application

Liyuan Wang et al. “A comprehensive survey of continual learning: Theory, method and application”. In:IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) (2024), pp. 1–20

2024
[9]

Gradient episodic memory for continual learn- ing

David Lopez-Paz and Marc’Aurelio Ranzato. “Gradient episodic memory for continual learn- ing”. In:Proceedings of 31st Conference on Advances in Neural Information Processing Systems (NeurIPS). Vol. 30. 2017, pp. 6470–6479

2017
[10]

Efficient Lifelong Learning with A-GEM

Arslan Chaudhry et al. “Efficient lifelong learning with a-gem”. In:arXiv preprint arXiv:1812.00420 (2018)

work page internal anchor Pith review Pith/arXiv arXiv 2018
[11]

Gradient based sample selection for online continual learning

Rahaf Aljundi et al. “Gradient based sample selection for online continual learning”. In: Proceeding of 33rd Conference on Neural Information Processing Systems (NeurIPS). Vol. 32. 2019

2019
[12]

Online continual learning from imbalanced data

Aristotelis Chrysakis and Marie-Francine Moens. “Online continual learning from imbalanced data”. In:Proceeding of 37th International Conference on Machine Learning (ICML). 2020, pp. 1952–1961

2020
[13]

Continuous Learning for Android Malware Detection

Yizheng Chen, Zhoujie Ding, and David Wagner. “Continuous Learning for Android Malware Detection”. In:Proceedings of the 32nd USENIX Security Symposium (USENIX Security). Aug. 2023, pp. 1127–1144

2023
[14]

BODMAS: An open dataset for learning based temporal analysis ofPE malware

Limin Yang et al. “BODMAS: An open dataset for learning based temporal analysis ofPE malware”. In:2021 IEEE Security and Privacy Workshops (SPW). 2021, pp. 78–84

2021
[15]

Drebin: Effective and explainable detection of android malware in your pocket

Daniel Arp et al. “Drebin: Effective and explainable detection of android malware in your pocket.” In:Proceedings of the Network and Distributed Systems Security (NDSS) Symposium. Vol. 14. 2014, pp. 23–26

2014
[16]

TESSERACT: Eliminating experimental bias in malware classifica- tion across space and time

Feargus Pendlebury et al. “TESSERACT: Eliminating experimental bias in malware classifica- tion across space and time”. In:Proceedings of the 28th USENIX Security Symposium (USENIX Security). 2019, pp. 729–746

2019
[17]

CADE: Detecting and explaining concept drift samples for security appli- cations

Limin Yang et al. “CADE: Detecting and explaining concept drift samples for security appli- cations”. In:30th USENIX Security Symposium (USENIX Security 21). 2021, pp. 2327–2344

2021
[18]

From grim reality to practical solution: Malware classification in real-world noise

Xian Wu et al. “From grim reality to practical solution: Malware classification in real-world noise”. In:Proceedings of the IEEE Symposium on Security and Privacy (SP). 2023, pp. 2602– 2619

2023
[19]

On the limitations of contin- ual learning for malware classification

Mohammad Saidur Rahman, Scott Coull, and Matthew Wright. “On the limitations of contin- ual learning for malware classification”. In:Proceedings of the First Conference on Lifelong Learning Agents (CoLLAs). 2022, pp. 564–582

2022
[20]

Reviewer integration and performance measurement for malware detec- tion

Brad Miller et al. “Reviewer integration and performance measurement for malware detec- tion”. In:Proceedings of the Detection of Intrusions and Malware, and Vulnerability Assessment (DIMV A). 2016, pp. 122–141

2016
[21]

Error prevalence in nids datasets: A case study on cic-ids-2017 and cse-cic- ids-2018

Lisa Liu et al. “Error prevalence in nids datasets: A case study on cic-ids-2017 and cse-cic- ids-2018”. In:Proceedings of IEEE Conference on Communications and Network Security (CNS). 2022, pp. 254–262

2017
[22]

Springer Nature, 2022

Xiaojin Zhu and Andrew B Goldberg.Introduction to semi-supervised learning. Springer Nature, 2022

2022
[23]

Transcend: Detecting concept drift in malware classification models

Roberto Jordaney et al. “Transcend: Detecting concept drift in malware classification models”. In:26th USENIX security symposium (USENIX security 17). 2017, pp. 625–642. , Vol. 1, No. 1, Article . Publication date: May 2026. SEED: Semi-supervised Continual MalwarE Detection for Tackling ConcEpt Drift on a BuDget 41

2017
[24]

Anomaly Detection in the Open World: Normality Shift Detection, Explanation, and Adaptation

Dongqi Han et al. “Anomaly Detection in the Open World: Normality Shift Detection, Explanation, and Adaptation”. In:Proceeding of Network and Distributed System Security Symposium (NDSS). 2023

2023
[25]

Transcending transcend: Revisiting malware classification in the presence of concept drift

Federico Barbero et al. “Transcending transcend: Revisiting malware classification in the presence of concept drift”. In:2022 IEEE Symposium on Security and Privacy (SP). 2022, pp. 805–823

2022
[26]

ReCDA: Concept Drift Adaptation with Representation Enhancement for Network Intrusion Detection

Shuo Yang et al. “ReCDA: Concept Drift Adaptation with Representation Enhancement for Network Intrusion Detection”. In:Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2024, pp. 3818–3828

2024
[27]

Detecting and Mitigating Sampling Bias in Cybersecu- rity with Unlabeled Data

Saravanan Thirumuruganathan et al. “Detecting and Mitigating Sampling Bias in Cybersecu- rity with Unlabeled Data”. In:33rd USENIX Security Symposium (USENIX Security 24). 2024, pp. 1741–1758

2024
[28]

Insomnia: Towards concept-drift robustness in network intrusion detection

Giuseppina Andresini et al. “Insomnia: Towards concept-drift robustness in network intrusion detection”. In:Proceedings of the 14th ACM Workshop on Artificial Intelligence and Security. 2021, pp. 111–122

2021
[29]

Droidevolver: Self-evolving android malware detection system

Ke Xu et al. “Droidevolver: Self-evolving android malware detection system”. In:2019 IEEE European Symposium on Security and Privacy (EuroS&P). IEEE. 2019, pp. 47–62

2019
[30]

Investigating labelless drift adaptation for malware detection

Zeliang Kan et al. “Investigating labelless drift adaptation for malware detection”. In:Pro- ceedings of the 14th ACM Workshop on Artificial Intelligence and Security. 2021, pp. 123– 134

2021
[31]

Application of Anomaly Detection Models to Malware Detection in the Presence of Concept Drift

David Escudero García and Noemí DeCastro-García. “Application of Anomaly Detection Models to Malware Detection in the Presence of Concept Drift”. In:International Conference on Hybrid Artificial Intelligence Systems. Springer. 2023, pp. 15–26

2023
[32]

Vime: Extending the success of self-and semi-supervised learning to tabular domain

Jinsung Yoon et al. “Vime: Extending the success of self-and semi-supervised learning to tabular domain”. In:Advances in Neural Information Processing Systems33 (2020), pp. 11033– 11043

2020
[33]

When Adversarial Perturbations meet Concept Drift: an Exploratory Analysis on ML-NIDS

Giovanni Apruzzese, Aurore Fass, and Fabio Pierazzi. “When Adversarial Perturbations meet Concept Drift: an Exploratory Analysis on ML-NIDS”. In:The 17th ACM Workshop on Artificial Intelligence Security (AISec). 2024

2024
[34]

Pseudo-Labeling for Class Incremental Learning

Alexis Lechat, Stéphane Herbin, and Frédéric Jurie. “Pseudo-Labeling for Class Incremental Learning”. In:Proceedings of the 32nd British Machine Vision Conference (BMVC). 2021

2021
[35]

Ordisco: Effective and efficient usage of incremental unlabeled data for semi-supervised continual learning

Liyuan Wang et al. “Ordisco: Effective and efficient usage of incremental unlabeled data for semi-supervised continual learning”. In:Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2021, pp. 5383–5392

2021
[36]

SPIDER: A Semi-Supervised Continual Learning-based Network Intrusion Detection System

Suresh K. Amalapuram, Bheemarjuna Tamma, and Sumohana S. Channappayya. “SPIDER: A Semi-Supervised Continual Learning-based Network Intrusion Detection System”. In: Proceedings of the 43rd IEEE Conference on Computer Communications (IEEE INFOCOM). 2024, pp. 571–580

2024
[37]

Learning Multiple Layers of Features from Tiny Images

A. Krizhevsky. “Learning Multiple Layers of Features from Tiny Images”. In:Master’s Thesis, University of Toronto(2009).url: https://www.cs.utoronto.ca/~kriz/learning-features-2009- TR.pdf

2009
[38]

Imbalanced malware family classification using multimodal fusion and weight self-learning

Shudong Li et al. “Imbalanced malware family classification using multimodal fusion and weight self-learning”. In:IEEE Transactions on Intelligent Transportation Systems24.7 (2022), pp. 7642–7652

2022
[39]

Enhancing state-of-the-art classifiers with api semantics to detect evolved android malware

Xiaohan Zhang et al. “Enhancing state-of-the-art classifiers with api semantics to detect evolved android malware”. In:Proceedings of the 2020 ACM SIGSAC Conference on Computer and Communications Security. 2020, pp. 757–770. [41]AndroZoo. https://androzoo.uni.lu/.. , Vol. 1, No. 1, Article . Publication date: May 2026. 42 Suresh Kumar Amalapuram, Bikraj S...

2020
[40]

EMBER: An Open Dataset for Training Static PE Malware Machine Learning Models

H. S. Anderson and P. Roth. “EMBER: An Open Dataset for Training Static PE Malware Machine Learning Models”. In:ArXiv e-prints(Apr. 2018). arXiv: 1804.04637[cs.CR]

work page internal anchor Pith review Pith/arXiv arXiv 2018
[41]

Aug- mented Memory Replay-based Continual Learning Approaches for Network Intrusion Detec- tion

Suresh Kumar Amalapuram, Sumohana S. Channappayya, and Bheemarjuna Tamma. “Aug- mented Memory Replay-based Continual Learning Approaches for Network Intrusion Detec- tion”. In:Proceedings of 37th Conference on Neural Information Processing Systems (NeurIPS). Vol. 36. 2023

2023
[42]

A theoretical analysis of the learning dynamics under class imbalance

Emanuele Francazi, Marco Baity-Jesi, and Aurelien Lucchi. “A theoretical analysis of the learning dynamics under class imbalance”. In:International Conference on Machine Learning. PMLR. 2023, pp. 10285–10322

2023
[43]

Gradient reweighting: Towards imbalanced class-incremental learning

Jiangpeng He. “Gradient reweighting: Towards imbalanced class-incremental learning”. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2024, pp. 16668–16677

2024
[44]

Orthogonal gradient descent for continual learning

Mehrdad Farajtabar et al. “Orthogonal gradient descent for continual learning”. In:Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS). 2020, pp. 3762–3773

2020
[45]

Gradient Projection Memory for Continual Learning

Gobinda Saha, Isha Garg, and Kaushik Roy. “Gradient Projection Memory for Continual Learning”. In:Proceedings of the International Conference on Learning Representations (ICLR). 2020

2020
[46]

Rethinking Gradient Projection Continual Learning: Stability/Plasticity Feature Space Decoupling

Zhen Zhao et al. “Rethinking Gradient Projection Continual Learning: Stability/Plasticity Feature Space Decoupling”. In:Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2023, pp. 3718–3727

2023
[47]

Re-measuring the label dynamics of online anti-malware engines from millions of samples

Jingjing Wang et al. “Re-measuring the label dynamics of online anti-malware engines from millions of samples”. In:Proceedings of the 2023 ACM on Internet Measurement Conference. 2023, pp. 253–267

2023
[48]

Measuring and Modeling the Label Dynamics of Online Anti-Malware Engines

Shuofei Zhu et al. “Measuring and Modeling the Label Dynamics of Online Anti-Malware Engines”. In:29th USENIX Security Symposium (USENIX Security 20). USENIX Association, Aug. 2020, pp. 2361–2378.isbn: 978-1-939133-17-5.url: https://www.usenix.org/conference/ usenixsecurity20/presentation/zhu

2020
[49]

Towards more realistic evaluations: The impact of label delays in malware detection pipelines

Marcus Botacin and Heitor Gomes. “Towards more realistic evaluations: The impact of label delays in malware detection pipelines”. In:Computers & Security148 (2025), p. 104122

2025
[50]

Overcoming catastrophic forgetting in neural networks

James Kirkpatrick et al. “Overcoming catastrophic forgetting in neural networks”. In:National Academy of Sciences114.13 (2017), pp. 3521–3526

2017
[51]

Online continual learning with maximal interfered retrieval

Rahaf Aljundi et al. “Online continual learning with maximal interfered retrieval”. In:Pro- ceeding of 33rd Conference on Neural Information Processing Systems (NeurIPS). Vol. 32. 2019

2019
[52]

Dos and don’ts of machine learning in computer security

Daniel Arp et al. “Dos and don’ts of machine learning in computer security”. In:Proceeding of 31st USENIX Security Symposium (USENIX Security). 2022

2022
[53]

AnoShift: A distribution shift benchmark for unsupervised anomaly de- tection

Marius Dragoi et al. “AnoShift: A distribution shift benchmark for unsupervised anomaly de- tection”. In:Proceeding of 36th Conference on Neural Information Processing Systems (NeurIPS). Vol. 35. 2022, pp. 32854–32867. , Vol. 1, No. 1, Article . Publication date: May 2026

2022

[1] [1]

Imagenet classification with deep convolutional neural networks

Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. “Imagenet classification with deep convolutional neural networks”. In:in Proceedings of the Advances in Neural Information Processing Systems25 (2012)

2012

[2] [2]

Attention is All you Need

Ashish Vaswani et al. “Attention is All you Need”. In:Proceedings of the Advances in Neural Information Processing Systems. Vol. 30. 2017

2017

[3] [3]

https://media.kaspersky.com/en/enterprise- security/KasperskyLab-Whitepaper-Machine-Learning.pdf

Kaspersky Lab Whitepaper Machine Learning. https://media.kaspersky.com/en/enterprise- security/KasperskyLab-Whitepaper-Machine-Learning.pdf. [4]Themida - Advanced Windows Software Protection. https://www.oreans.com/Themida.php

[4] [4]

https://www.oreans.com/ CodeVirtualizer.php

CodeVirtualizer - Obfuscation System Against Reverse Engineering. https://www.oreans.com/ CodeVirtualizer.php

[5] [5]

Drift Forensics of Malware Classifiers

Theo Chow et al. “Drift Forensics of Malware Classifiers”. In:Proceedings of the 16th ACM Workshop on Artificial Intelligence and Security (AISec). 2023, pp. 197–207. , Vol. 1, No. 1, Article . Publication date: May 2026. 40 Suresh Kumar Amalapuram, Bikraj Shresta, Siva Ram Murthy Chebiyam, Bheemarjuna Reddy Tamma, and Sumohana Channappayya

2023

[6] [6]

Robust semi-supervised learning when not all classes have labels

Lan-Zhe Guo et al. “Robust semi-supervised learning when not all classes have labels”. In:in Proceedings of the Advances in Neural Information Processing Systems35 (2022), pp. 3305–3317

2022

[7] [7]

Open-World Semi-Supervised Learning

Kaidi Cao, Maria Brbic, and Jure Leskovec. “Open-World Semi-Supervised Learning”. In: Proceedings of the International Conference on Learning Representations. 2022.url: https: //openreview.net/forum?id=O-r8LOR-CCA

2022

[8] [8]

A comprehensive survey of continual learning: Theory, method and application

Liyuan Wang et al. “A comprehensive survey of continual learning: Theory, method and application”. In:IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) (2024), pp. 1–20

2024

[9] [9]

Gradient episodic memory for continual learn- ing

David Lopez-Paz and Marc’Aurelio Ranzato. “Gradient episodic memory for continual learn- ing”. In:Proceedings of 31st Conference on Advances in Neural Information Processing Systems (NeurIPS). Vol. 30. 2017, pp. 6470–6479

2017

[10] [10]

Efficient Lifelong Learning with A-GEM

Arslan Chaudhry et al. “Efficient lifelong learning with a-gem”. In:arXiv preprint arXiv:1812.00420 (2018)

work page internal anchor Pith review Pith/arXiv arXiv 2018

[11] [11]

Gradient based sample selection for online continual learning

Rahaf Aljundi et al. “Gradient based sample selection for online continual learning”. In: Proceeding of 33rd Conference on Neural Information Processing Systems (NeurIPS). Vol. 32. 2019

2019

[12] [12]

Online continual learning from imbalanced data

Aristotelis Chrysakis and Marie-Francine Moens. “Online continual learning from imbalanced data”. In:Proceeding of 37th International Conference on Machine Learning (ICML). 2020, pp. 1952–1961

2020

[13] [13]

Continuous Learning for Android Malware Detection

Yizheng Chen, Zhoujie Ding, and David Wagner. “Continuous Learning for Android Malware Detection”. In:Proceedings of the 32nd USENIX Security Symposium (USENIX Security). Aug. 2023, pp. 1127–1144

2023

[14] [14]

BODMAS: An open dataset for learning based temporal analysis ofPE malware

Limin Yang et al. “BODMAS: An open dataset for learning based temporal analysis ofPE malware”. In:2021 IEEE Security and Privacy Workshops (SPW). 2021, pp. 78–84

2021

[15] [15]

Drebin: Effective and explainable detection of android malware in your pocket

Daniel Arp et al. “Drebin: Effective and explainable detection of android malware in your pocket.” In:Proceedings of the Network and Distributed Systems Security (NDSS) Symposium. Vol. 14. 2014, pp. 23–26

2014

[16] [16]

TESSERACT: Eliminating experimental bias in malware classifica- tion across space and time

Feargus Pendlebury et al. “TESSERACT: Eliminating experimental bias in malware classifica- tion across space and time”. In:Proceedings of the 28th USENIX Security Symposium (USENIX Security). 2019, pp. 729–746

2019

[17] [17]

CADE: Detecting and explaining concept drift samples for security appli- cations

Limin Yang et al. “CADE: Detecting and explaining concept drift samples for security appli- cations”. In:30th USENIX Security Symposium (USENIX Security 21). 2021, pp. 2327–2344

2021

[18] [18]

From grim reality to practical solution: Malware classification in real-world noise

Xian Wu et al. “From grim reality to practical solution: Malware classification in real-world noise”. In:Proceedings of the IEEE Symposium on Security and Privacy (SP). 2023, pp. 2602– 2619

2023

[19] [19]

On the limitations of contin- ual learning for malware classification

Mohammad Saidur Rahman, Scott Coull, and Matthew Wright. “On the limitations of contin- ual learning for malware classification”. In:Proceedings of the First Conference on Lifelong Learning Agents (CoLLAs). 2022, pp. 564–582

2022

[20] [20]

Reviewer integration and performance measurement for malware detec- tion

Brad Miller et al. “Reviewer integration and performance measurement for malware detec- tion”. In:Proceedings of the Detection of Intrusions and Malware, and Vulnerability Assessment (DIMV A). 2016, pp. 122–141

2016

[21] [21]

Error prevalence in nids datasets: A case study on cic-ids-2017 and cse-cic- ids-2018

Lisa Liu et al. “Error prevalence in nids datasets: A case study on cic-ids-2017 and cse-cic- ids-2018”. In:Proceedings of IEEE Conference on Communications and Network Security (CNS). 2022, pp. 254–262

2017

[22] [22]

Springer Nature, 2022

Xiaojin Zhu and Andrew B Goldberg.Introduction to semi-supervised learning. Springer Nature, 2022

2022

[23] [23]

Transcend: Detecting concept drift in malware classification models

Roberto Jordaney et al. “Transcend: Detecting concept drift in malware classification models”. In:26th USENIX security symposium (USENIX security 17). 2017, pp. 625–642. , Vol. 1, No. 1, Article . Publication date: May 2026. SEED: Semi-supervised Continual MalwarE Detection for Tackling ConcEpt Drift on a BuDget 41

2017

[24] [24]

Anomaly Detection in the Open World: Normality Shift Detection, Explanation, and Adaptation

Dongqi Han et al. “Anomaly Detection in the Open World: Normality Shift Detection, Explanation, and Adaptation”. In:Proceeding of Network and Distributed System Security Symposium (NDSS). 2023

2023

[25] [25]

Transcending transcend: Revisiting malware classification in the presence of concept drift

Federico Barbero et al. “Transcending transcend: Revisiting malware classification in the presence of concept drift”. In:2022 IEEE Symposium on Security and Privacy (SP). 2022, pp. 805–823

2022

[26] [26]

ReCDA: Concept Drift Adaptation with Representation Enhancement for Network Intrusion Detection

Shuo Yang et al. “ReCDA: Concept Drift Adaptation with Representation Enhancement for Network Intrusion Detection”. In:Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2024, pp. 3818–3828

2024

[27] [27]

Detecting and Mitigating Sampling Bias in Cybersecu- rity with Unlabeled Data

Saravanan Thirumuruganathan et al. “Detecting and Mitigating Sampling Bias in Cybersecu- rity with Unlabeled Data”. In:33rd USENIX Security Symposium (USENIX Security 24). 2024, pp. 1741–1758

2024

[28] [28]

Insomnia: Towards concept-drift robustness in network intrusion detection

Giuseppina Andresini et al. “Insomnia: Towards concept-drift robustness in network intrusion detection”. In:Proceedings of the 14th ACM Workshop on Artificial Intelligence and Security. 2021, pp. 111–122

2021

[29] [29]

Droidevolver: Self-evolving android malware detection system

Ke Xu et al. “Droidevolver: Self-evolving android malware detection system”. In:2019 IEEE European Symposium on Security and Privacy (EuroS&P). IEEE. 2019, pp. 47–62

2019

[30] [30]

Investigating labelless drift adaptation for malware detection

Zeliang Kan et al. “Investigating labelless drift adaptation for malware detection”. In:Pro- ceedings of the 14th ACM Workshop on Artificial Intelligence and Security. 2021, pp. 123– 134

2021

[31] [31]

Application of Anomaly Detection Models to Malware Detection in the Presence of Concept Drift

David Escudero García and Noemí DeCastro-García. “Application of Anomaly Detection Models to Malware Detection in the Presence of Concept Drift”. In:International Conference on Hybrid Artificial Intelligence Systems. Springer. 2023, pp. 15–26

2023

[32] [32]

Vime: Extending the success of self-and semi-supervised learning to tabular domain

Jinsung Yoon et al. “Vime: Extending the success of self-and semi-supervised learning to tabular domain”. In:Advances in Neural Information Processing Systems33 (2020), pp. 11033– 11043

2020

[33] [33]

When Adversarial Perturbations meet Concept Drift: an Exploratory Analysis on ML-NIDS

Giovanni Apruzzese, Aurore Fass, and Fabio Pierazzi. “When Adversarial Perturbations meet Concept Drift: an Exploratory Analysis on ML-NIDS”. In:The 17th ACM Workshop on Artificial Intelligence Security (AISec). 2024

2024

[34] [34]

Pseudo-Labeling for Class Incremental Learning

Alexis Lechat, Stéphane Herbin, and Frédéric Jurie. “Pseudo-Labeling for Class Incremental Learning”. In:Proceedings of the 32nd British Machine Vision Conference (BMVC). 2021

2021

[35] [35]

Ordisco: Effective and efficient usage of incremental unlabeled data for semi-supervised continual learning

Liyuan Wang et al. “Ordisco: Effective and efficient usage of incremental unlabeled data for semi-supervised continual learning”. In:Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2021, pp. 5383–5392

2021

[36] [36]

SPIDER: A Semi-Supervised Continual Learning-based Network Intrusion Detection System

Suresh K. Amalapuram, Bheemarjuna Tamma, and Sumohana S. Channappayya. “SPIDER: A Semi-Supervised Continual Learning-based Network Intrusion Detection System”. In: Proceedings of the 43rd IEEE Conference on Computer Communications (IEEE INFOCOM). 2024, pp. 571–580

2024

[37] [37]

Learning Multiple Layers of Features from Tiny Images

A. Krizhevsky. “Learning Multiple Layers of Features from Tiny Images”. In:Master’s Thesis, University of Toronto(2009).url: https://www.cs.utoronto.ca/~kriz/learning-features-2009- TR.pdf

2009

[38] [38]

Imbalanced malware family classification using multimodal fusion and weight self-learning

Shudong Li et al. “Imbalanced malware family classification using multimodal fusion and weight self-learning”. In:IEEE Transactions on Intelligent Transportation Systems24.7 (2022), pp. 7642–7652

2022

[39] [39]

Enhancing state-of-the-art classifiers with api semantics to detect evolved android malware

Xiaohan Zhang et al. “Enhancing state-of-the-art classifiers with api semantics to detect evolved android malware”. In:Proceedings of the 2020 ACM SIGSAC Conference on Computer and Communications Security. 2020, pp. 757–770. [41]AndroZoo. https://androzoo.uni.lu/.. , Vol. 1, No. 1, Article . Publication date: May 2026. 42 Suresh Kumar Amalapuram, Bikraj S...

2020

[40] [40]

EMBER: An Open Dataset for Training Static PE Malware Machine Learning Models

H. S. Anderson and P. Roth. “EMBER: An Open Dataset for Training Static PE Malware Machine Learning Models”. In:ArXiv e-prints(Apr. 2018). arXiv: 1804.04637[cs.CR]

work page internal anchor Pith review Pith/arXiv arXiv 2018

[41] [41]

Aug- mented Memory Replay-based Continual Learning Approaches for Network Intrusion Detec- tion

Suresh Kumar Amalapuram, Sumohana S. Channappayya, and Bheemarjuna Tamma. “Aug- mented Memory Replay-based Continual Learning Approaches for Network Intrusion Detec- tion”. In:Proceedings of 37th Conference on Neural Information Processing Systems (NeurIPS). Vol. 36. 2023

2023

[42] [42]

A theoretical analysis of the learning dynamics under class imbalance

Emanuele Francazi, Marco Baity-Jesi, and Aurelien Lucchi. “A theoretical analysis of the learning dynamics under class imbalance”. In:International Conference on Machine Learning. PMLR. 2023, pp. 10285–10322

2023

[43] [43]

Gradient reweighting: Towards imbalanced class-incremental learning

Jiangpeng He. “Gradient reweighting: Towards imbalanced class-incremental learning”. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2024, pp. 16668–16677

2024

[44] [44]

Orthogonal gradient descent for continual learning

Mehrdad Farajtabar et al. “Orthogonal gradient descent for continual learning”. In:Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS). 2020, pp. 3762–3773

2020

[45] [45]

Gradient Projection Memory for Continual Learning

Gobinda Saha, Isha Garg, and Kaushik Roy. “Gradient Projection Memory for Continual Learning”. In:Proceedings of the International Conference on Learning Representations (ICLR). 2020

2020

[46] [46]

Rethinking Gradient Projection Continual Learning: Stability/Plasticity Feature Space Decoupling

Zhen Zhao et al. “Rethinking Gradient Projection Continual Learning: Stability/Plasticity Feature Space Decoupling”. In:Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2023, pp. 3718–3727

2023

[47] [47]

Re-measuring the label dynamics of online anti-malware engines from millions of samples

Jingjing Wang et al. “Re-measuring the label dynamics of online anti-malware engines from millions of samples”. In:Proceedings of the 2023 ACM on Internet Measurement Conference. 2023, pp. 253–267

2023

[48] [48]

Measuring and Modeling the Label Dynamics of Online Anti-Malware Engines

Shuofei Zhu et al. “Measuring and Modeling the Label Dynamics of Online Anti-Malware Engines”. In:29th USENIX Security Symposium (USENIX Security 20). USENIX Association, Aug. 2020, pp. 2361–2378.isbn: 978-1-939133-17-5.url: https://www.usenix.org/conference/ usenixsecurity20/presentation/zhu

2020

[49] [49]

Towards more realistic evaluations: The impact of label delays in malware detection pipelines

Marcus Botacin and Heitor Gomes. “Towards more realistic evaluations: The impact of label delays in malware detection pipelines”. In:Computers & Security148 (2025), p. 104122

2025

[50] [50]

Overcoming catastrophic forgetting in neural networks

James Kirkpatrick et al. “Overcoming catastrophic forgetting in neural networks”. In:National Academy of Sciences114.13 (2017), pp. 3521–3526

2017

[51] [51]

Online continual learning with maximal interfered retrieval

Rahaf Aljundi et al. “Online continual learning with maximal interfered retrieval”. In:Pro- ceeding of 33rd Conference on Neural Information Processing Systems (NeurIPS). Vol. 32. 2019

2019

[52] [52]

Dos and don’ts of machine learning in computer security

Daniel Arp et al. “Dos and don’ts of machine learning in computer security”. In:Proceeding of 31st USENIX Security Symposium (USENIX Security). 2022

2022

[53] [53]

AnoShift: A distribution shift benchmark for unsupervised anomaly de- tection

Marius Dragoi et al. “AnoShift: A distribution shift benchmark for unsupervised anomaly de- tection”. In:Proceeding of 36th Conference on Neural Information Processing Systems (NeurIPS). Vol. 35. 2022, pp. 32854–32867. , Vol. 1, No. 1, Article . Publication date: May 2026

2022