pith. sign in

arxiv: 2606.03218 · v1 · pith:W3FZODD4new · submitted 2026-06-02 · 💻 cs.CR

The Role of Domain-Specific Features in Malware Detection: A macOS Case Study

Pith reviewed 2026-06-28 09:46 UTC · model grok-4.3

classification 💻 cs.CR
keywords macOS malware detectiondomain-specific featuresMach-O binariesmachine learning classificationstatic analysismalware generalizationbinary classification
0
0 comments X

The pith

Domain-specific macOS features enable a malware detector to reach 98.5 percent accuracy and maintain performance on new samples.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows that static features drawn from macOS binaries, such as embedded certificates, entitlements, persistence techniques, and selected system APIs, substantially raise the accuracy of machine learning malware detectors. These traits are extracted from a collection of more than 41,000 macOS executables and produce a detector that reaches 98.50 percent detection while beating prior methods by 16 percent on average. On a separate set of 9,000 fresh samples gathered later, the same model hits 99.50 percent detection, outperforming existing approaches by 50 percent, and removing the macOS-specific features causes a 15.92 percent drop. The results indicate that platform-unique binary details matter for both high detection rates and the ability to handle malware that appears after training.

Core claim

The paper establishes that static domain-specific features extracted from macOS Mach-O binaries, including embedded certificates, entitlements, persistence techniques and key system APIs, enable a machine learning detector to achieve 98.50 percent detection performance on a dataset of 41,129 samples and 99.50 percent on a temporal holdout of 9,000 fresh executables, outperforming prior approaches by 16 percent and 50 percent respectively, with the domain-specific features proving essential as their removal causes a 15.92 percent drop in detection on novel samples.

What carries the argument

The set of static domain-specific features for macOS binaries, such as embedded certificates, entitlements, persistence techniques, and key system APIs, which capture operating-system-unique traits for classifying unknown samples.

If this is right

  • Incorporating macOS-specific static traits produces higher detection rates than generic binary features alone.
  • The same features support stronger generalization when the model encounters malware samples collected after the training period.
  • Prior detectors that omit these platform traits will continue to show lower performance on macOS executables.
  • Releasing the labeled dataset allows other researchers to test and extend models that use these features.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Comparable platform-native feature sets may improve detection accuracy on other operating systems whose binary formats have received less study.
  • Tracking which macOS-specific attributes contribute most to decisions could help design lightweight rules that target common persistence methods.
  • If malware authors begin altering or stripping the highlighted domain-specific attributes, periodic retraining on updated feature definitions would be needed to preserve performance.

Load-bearing premise

The 41,129-sample training set and the separate 9,000-sample temporal test set are drawn from distributions that remain representative of real-world macOS executables over time and are not biased by collection method or labeling errors.

What would settle it

An independent collection of recent macOS binaries on which the detector without the domain-specific features matches or exceeds the detection rate of the full feature set.

Figures

Figures reproduced from arXiv: 2606.03218 by Andrea Oliveri, Biagio Montaruli, Davide Balzarotti, Savino Dambra.

Figure 1
Figure 1. Figure 1: Mach-O file format. structure of segments, allowing the operating system to handle and protect different types of data appropriately. Comparison with Windows Executables. The Mach-O file for￾mat differs from the Windows PE format [36] in several aspects. Mach-O binaries contain a single Mach header, while PE files begin with an MS-DOS stub, followed by the PE header, which includes a COFF (Common Object Fi… view at source ↗
Figure 2
Figure 2. Figure 2: Top-20 entitlements. [sec], [dev] and [gen] pre￾fixes represent the entitlements’ category: security-related, developer-related and generic one, respectively. As for the potentially packed goodware samples, we found that most of them (139) are x86-64 binaries, while 50 are arm64 and 104 are fat binaries. Packing is not very prevalent among macOS malware executa￾bles, with around 10% of the samples in our d… view at source ↗
Figure 3
Figure 3. Figure 3: Analysis of the most common macOS APIs. Persistence is not widely used in the dataset, with only 3% of samples employing such techniques. The presence of login items depends on both the sample’s nature and its architecture, while the presence of launch items is more indicative of malicious behavior, being more common in malware samples regardless of the architecture. 4.3.5 Analysis of the certificate statu… view at source ↗
Figure 4
Figure 4. Figure 4: Feature importance based on total gain for our detector. The macOS-specific features are highlighted in bold. [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: ROC curves of the machine learning models trained on the features proposed in this work, namely Decision Tree ( [PITH_FULL_IMAGE:figures/full_fig_p015_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: ROC curves of the XGBoost model trained on the state-of-the-art features sets, namely ours ( [PITH_FULL_IMAGE:figures/full_fig_p015_6.png] view at source ↗
read the original abstract

Despite the growing popularity of macOS among end users and enterprise systems, malware research has primarily focused on Windows and Android operating systems, leaving the problem of macOS malware detection relatively unexplored. Indeed, the specificity of the operating system and the unique characteristics of the Mach-O file format can play a fundamental role in the classification of unknown samples, drastically increasing the detection rate. In this work, for the first time in the literature, we employ new domain-specific features, i.e., static features specific to macOS binaries, such as embedded certificates, entitlements, persistence techniques and key system APIs, to train a machine learning malware detector. We perform a comprehensive experimental evaluation on a novel dataset of 41,129 samples, comprising 11,413 benign and 29,716 malicious executables, and demonstrate that our solution achieves state-of-the-art detection performance (98.50%), outperforming all existing approaches, with an average improvement of 16% in terms of detection rate. We also provide an in-depth analysis of the importance of the individual features, showing that our detector effectively leverages the new domain-specific features. Then, in order to evaluate the generalization capabilities of our detector over time, we perform a real-world evaluation on a new dataset of 9,000 fresh macOS executables. The results show that (i) our detector maintains a very high detection rate (99.50%), (ii) outperforms the state-of-the-art by 50%, and (iii) the domain-specific features are crucial for generalizing to novel malware samples, as their removal leads to a 15.92% drop in detection performance. Finally, we also release our dataset to the research community.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The manuscript introduces domain-specific static features for macOS malware detection based on the Mach-O format (embedded certificates, entitlements, persistence techniques, and key system APIs). It reports training ML models on a new dataset of 41,129 samples (11,413 benign, 29,716 malicious) to achieve 98.50% detection performance, outperforming prior work by an average of 16%. A temporal hold-out evaluation on 9,000 fresh samples yields 99.50% detection, with an ablation study indicating that removing the domain-specific features causes a 15.92% drop; the dataset is released publicly.

Significance. If the performance numbers and ablation results hold under reliable labeling, the work advances the under-explored macOS malware detection literature by demonstrating the value of OS-specific features and providing the first large public macOS malware dataset with temporal validation. Releasing the dataset is a clear strength that supports reproducibility and follow-on research.

major comments (2)
  1. [Experimental Evaluation] Dataset Collection and Labeling (Experimental Evaluation): The samples are stated to have been collected from public repositories and labeled via AV engines, yet no quantitative audit is provided (e.g., number of scanners, agreement threshold for positive labels, inter-scanner consistency, or manual review of a held-out subset). Because the headline claims (98.50% accuracy, 99.50% temporal detection, and the 15.92% ablation delta) rest directly on label correctness, the absence of such validation leaves open the possibility that label noise inflates both absolute performance and the reported importance of the domain-specific features.
  2. [Methods / Feature Engineering] Feature Extraction Procedure (Methods / Feature Engineering): The extraction logic for the new domain-specific features (e.g., how entitlements are parsed or persistence techniques identified) is described at a high level without pseudocode, implementation details, or reference to the released dataset schema. This prevents independent verification that the ablation study isolates the contribution of these features rather than implementation artifacts, directly affecting the central claim that the features are “crucial for generalizing to novel malware samples.”
minor comments (1)
  1. [Abstract] The abstract states that the approach “outperforms all existing approaches” without naming the baselines or citing them; moving the comparison details into the abstract or adding a short table reference would improve clarity.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments, which highlight important aspects of reproducibility and validation. We address each major comment below and commit to revisions that strengthen the manuscript without altering its core claims.

read point-by-point responses
  1. Referee: The samples are stated to have been collected from public repositories and labeled via AV engines, yet no quantitative audit is provided (e.g., number of scanners, agreement threshold for positive labels, inter-scanner consistency, or manual review of a held-out subset). Because the headline claims (98.50% accuracy, 99.50% temporal detection, and the 15.92% ablation delta) rest directly on label correctness, the absence of such validation leaves open the possibility that label noise inflates both absolute performance and the reported importance of the domain-specific features.

    Authors: We agree that explicit documentation of the labeling process is essential given the reliance on AV-based labels. The original manuscript described collection from public repositories and AV labeling at a high level. In revision we will add a dedicated subsection detailing the labeling procedure, including the specific AV engines consulted, the minimum agreement threshold applied for malicious labels, inter-scanner consistency metrics where available, and an explicit discussion of the limitations of AV labeling. This will allow readers to assess potential label noise directly. revision: yes

  2. Referee: The extraction logic for the new domain-specific features (e.g., how entitlements are parsed or persistence techniques identified) is described at a high level without pseudocode, implementation details, or reference to the released dataset schema. This prevents independent verification that the ablation study isolates the contribution of these features rather than implementation artifacts, directly affecting the central claim that the features are “crucial for generalizing to novel malware samples.”

    Authors: We concur that greater implementation transparency is required to support independent verification of the ablation results. In the revised manuscript we will include pseudocode for the core extraction routines (certificate parsing, entitlement extraction, persistence technique identification, and API usage), provide additional implementation notes, and explicitly reference the schema and documentation of the publicly released dataset so that the ablation study can be reproduced from the released artifacts. revision: yes

Circularity Check

0 steps flagged

No circularity: standard empirical ML pipeline on independent datasets

full rationale

The paper describes collection of a 41k-sample training set and a separate 9k temporal test set of macOS binaries, extraction of static features (including new domain-specific ones), training of an ML classifier, and reporting of accuracy/F1 on the held-out sets plus an ablation study. No equations, predictions, or uniqueness claims are present that reduce reported performance metrics to a parameter fitted from the same data by construction. The temporal test set is described as fresh and independent. Feature-importance analysis is post-hoc and does not feed back into the performance numbers. This is a conventional supervised-learning evaluation whose central claims rest on external data rather than self-referential definitions or self-citation chains.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the domain assumption that the chosen macOS-specific signals carry malware signal orthogonal to generic features and that the collected samples reflect the distribution of real threats; no free parameters or invented entities are described in the abstract.

axioms (1)
  • domain assumption The selected domain-specific features (certificates, entitlements, persistence techniques, key system APIs) are stable indicators that distinguish malicious from benign macOS binaries.
    Invoked when the authors state that these features 'play a fundamental role in the classification' and that their removal drops performance.

pith-pipeline@v0.9.1-grok · 5849 in / 1379 out tokens · 23811 ms · 2026-06-28T09:46:31.711426+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

66 extracted references · 13 canonical work pages

  1. [1]

    Anderson and Phil Roth

    Hyrum S. Anderson and Phil Roth. 2018. EMBER: An Open Dataset for Train- ing Static PE Malware Machine Learning Models.ArXiv e-prints(April 2018). arXiv:1804.04637

  2. [2]

    Apple. 2021. App code signing process in macOS. https://support.apple.com/en- gb/guide/security/sec3ad8e6e53/web Accessed: June 3, 2026

  3. [3]

    Apple. 2025. App Sandbox. https://developer.apple.com/documentation/securi ty/app-sandbox Accessed: June 3, 2026

  4. [4]

    Apple. 2025. Apple Documentation. https://developer.apple.com/documentation/ Accessed: June 3, 2026

  5. [5]

    Apple. 2025. Apple Platform Security. https://support.apple.com/en-us/102149 Accessed: June 3, 2026

  6. [6]

    Apple. 2025. Bundle Programming Guide. https://developer.apple.com/library/ archive/documentation/CoreFoundation/Conceptual/CFBundles/BundleTypes/ BundleTypes.html#//apple_ref/doc/uid/10000123i-CH101-SW1 Accessed: June 3, 2026

  7. [7]

    Apple. 2025. Entitlements. https://developer.apple.com/documentation/bundle resources/entitlements Accessed: June 3, 2026

  8. [8]

    Apple. 2025. Gatekeeper and runtime protection in macOS. https://support.ap ple.com/en-gb/guide/security/sec5599b66df/web Accessed: June 3, 2026

  9. [9]

    Apple. 2025. Hardened Runtime. https://developer.apple.com/documentation/se curity/hardened-runtime Accessed: June 3, 2026

  10. [10]

    Daniel Arp, Michael Spreitzenbarth, Malte Hubner, Hugo Gascon, and Konrad Rieck. 2014. Drebin: Effective and explainable detection of android malware in your pocket.. InNDSS, Vol. 14. The Internet Society, 23–26

  11. [11]

    Christopher M. Bishop. 2006.Pattern Recognition and Machine Learning (Infor- mation Science and Statistics). Springer-Verlag, Berlin, Heidelberg

  12. [12]

    Henrik Boström. 2022. crepes: a Python Package for Generating Conformal Regressors and Predictive Systems. InProceedings of the Eleventh Symposium on Conformal and Probabilistic Prediction and Applications (Proceedings of Machine Learning Research, Vol. 179), Ulf Johansson, Henrik Boström, Khuong An Nguyen, Zhiyuan Luo, and Lars Carlsson (Eds.). PMLR

  13. [13]

    Leo Breiman. 2001. Random forests.Machine learning45 (2001), 5–32

  14. [14]

    Jason Brownlee. 2024. XGBoost Best Feature Importance Score. https://xgboosti ng.com/xgboost-best-feature-importance-score/

  15. [15]

    Ero Carrera. 2025. Multi-platform Python module to parse and work with Portable Executable (PE) files. https://github.com/erocarrera/pefile Accessed: June 3, 2026

  16. [16]

    Nitesh V Chawla, Kevin W Bowyer, Lawrence O Hall, and W Philip Kegelmeyer

  17. [17]

    SMOTE: synthetic minority over-sampling technique.Journal of artificial intelligence research16 (2002), 321–357

  18. [18]

    2022.Machine Learning for OSX Malware Detection

    Alex Chenxingyu Chen and Kenneth Wulff. 2022.Machine Learning for OSX Malware Detection. Springer International Publishing, Cham, 209–222. https: //doi.org/10.1007/978-3-030-74753-4_14

  19. [19]

    Tianqi Chen and Carlos Guestrin. 2016. XGBoost: A Scalable Tree Boosting System. InProceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(San Francisco, California, USA)(KDD ’16). Association for Computing Machinery, New York, NY, USA, 785–794. https: //doi.org/10.1145/2939672.2939785

  20. [20]

    Emanuele Cozzi, Mariano Graziano, Yanick Fratantonio, and Davide Balzarotti

  21. [21]

    In2018 IEEE symposium on security and privacy (SP)

    Understanding linux malware. In2018 IEEE symposium on security and privacy (SP). IEEE, 161–175

  22. [22]

    Savino Dambra, Yufei Han, Simone Aonzo, Platon Kotzias, Antonino Vitale, Juan Caballero, Davide Balzarotti, and Leyla Bilge. 2023. Decoding the Secrets of Machine Learning in Malware Classification: A Deep Dive into Datasets, Feature Extraction, and Model Performance. InProceedings of the 2023 ACM SIGSAC Conference on Computer and Communications Security ...

  23. [23]

    Luca Demetrio, Battista Biggio, Giovanni Lagorio, Fabio Roli, and Alessandro Armando. 2021. Functionality-Preserving Black-Box Optimization of Adversarial Windows Malware.IEEE Transactions on Information Forensics and Security16 (2021), 3469–3478. doi:10.1109/TIFS.2021.3082330

  24. [24]

    Jonny Evans. 2023. Three-quarters of large US firms now using more Apple devices – survey. https://www.computerworld.com/article/1634358/three- quarters-of-large-us-firms-now-using-more-apple-devices-survey.html

  25. [25]

    Tom Fakterman. 2023. Through the Cortex XDR Lens: macOS Pirrit Adware. https://www.paloaltonetworks.com/blog/security-operations/through-the- cortex-xdr-lens-macos-pirrit-adware/ Accessed: June 3, 2026

  26. [26]

    In: Proc

    Samira Eisaloo Gharghasheh and Shahrzad Hadayeghparast. 2022.Mac OS X Malware Detection with Supervised Machine Learning Algorithms. Springer International Publishing, Cham, 193–208. https://doi.org/10.1007/978-3-030- 74753-4_13

  27. [27]

    2025.Machine Learning for Windows Malware Detection and Classification: Methods, Challenges, and Ongoing Research

    Daniel Gibert. 2025.Machine Learning for Windows Malware Detection and Classification: Methods, Challenges, and Ongoing Research. Springer Nature Switzerland, 143–173. doi:10.1007/978-3-031-66245-4_6

  28. [28]

    Leo Grinsztajn, Edouard Oyallon, and Gael Varoquaux. 2022. Why do tree-based models still outperform deep learning on typical tabular data?. InAdvances in Neural Information Processing Systems, Vol. 35. Curran Associates, Inc., 507–520

  29. [29]

    Joyce, Edward Raff, Charles Nicholas, and James Holt

    Robert J. Joyce, Edward Raff, Charles Nicholas, and James Holt. 2023. MalDICT: Benchmark Datasets on Malware Behaviors, Platforms, Exploitation, and Packers. arXiv:2310.11706 [cs.CR]

  30. [30]

    Kaspersky Team. 2023. Are Macs safe? Threats to macOS users. https://www. kaspersky.com/blog/macos-users-cyberthreats-2023/50018/

  31. [31]

    Doowon Kim, Bum Jun Kwon, and Tudor Dumitraş. 2017. Certified Malware: Measuring Breaches of Trust in the Windows Code-Signing PKI. InProceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security (CCS ’17). Association for Computing Machinery, New York, NY, USA, 1435–1448. doi:10.1145/3133956.3133958

  32. [32]

    Bojan Kolosnjaji, Apostolis Zarras, George Webster, and Claudia Eckert. 2016. Deep Learning for Classification of Malware System Call Sequences. InAI 2016: Advances in Artificial Intelligence, Byeong Ho Kang and Quan Bai (Eds.). Springer International Publishing, 137–149

  33. [33]

    Matous Kozak, Luca Demetrio, Dmitrijs Trizna, and Fabio Roli. 2024. Updating Windows Malware Detectors: Balancing Robustness and Regression against Adversarial EXEmples. https://arxiv.org/abs/2405.02646

  34. [34]

    Serhii Londar. 2025. Awesome macOS open source applications. https://github .com/serhii-londar/open-source-mac-os-apps Accessed: July 30, 2024

  35. [35]

    MalwareBazaar Team. 2025. MalwareBazaar. https://bazaar.abuse.ch Accessed: July 30, 2024

  36. [36]

    Modhuparna Manna, Andrew Case, Aisha Ali-Gombe, and Golden G. Richard

  37. [37]

    doi:10.1016/j.fsidi.2021.301221

    Modern macOS userland runtime analysis.Forensic Science International: Digital Investigation38 (2021), 301221. doi:10.1016/j.fsidi.2021.301221

  38. [38]

    Howell Max. 2025. The Missing Package Manager for macOS (or Linux). https: //brew.sh/ Accessed: July 30, 2024

  39. [39]

    Microsoft. 2025. PE Format. https://learn.microsoft.com/en-us/windows/win32 /debug/pe-format Accessed: June 3, 2026

  40. [40]

    MITRE. 2025. Bundlore. https://attack.mitre.org/versions/v15/software/S0482/ Accessed: June 3, 2026

  41. [41]

    2022.Interpretable Machine Learning(2 ed.)

    Christoph Molnar. 2022.Interpretable Machine Learning(2 ed.). Lulu.com. https://christophm.github.io/interpretable-ml-book

  42. [42]

    Biagio Montaruli, Andrea Oliveri, Savino Dambra, and Davide Balzarotti. 2025. The Role of Domain-Specific Features in Malware Detection: A macOS Case Study – Dataset. https://github.com/eurecom-s3/macos-malware-dataset

  43. [43]

    Moonlock Lab Team. 2024. Moonlock’s 2024 macOS threat report. https: //moonlock.com/moonlock-2024-macos-threat-report

  44. [44]

    Objective-See Foundation. 2025. macOS Malware Collection. https://github.c om/objective-see/Malware Accessed: July 30, 2024

  45. [45]

    Hamed Haddad Pajouh, Ali Dehghantanha, Raouf Khayami, and Kim-Kwang Ray- mond Choo. 2018. Intelligent OS X malware threat detection with code inspection. Journal of Computer Virology and Hacking Techniques14, 3 (2018), 213–223

  46. [46]

    Pedregosa, G

    F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cour- napeau, M. Brucher, M. Perrot, and E. Duchesnay. 2011. Scikit-learn: Machine Learning in Python.Journal of Machine Learning Research12 (2011), 2825–2830

  47. [47]

    Andrea Ponte, Dmitrijs Trizna, Luca Demetrio, Battista Biggio, Ivan Tesfai Ogbu, and Fabio Roli. 2024. SLIFER: Investigating Performance and Robustness of Malware Detection Pipelines. https://arxiv.org/abs/2405.14478

  48. [48]

    Anderson, Bobby Filar, and Mark McLean

    Edward Raff, William Fleshman, Richard Zak, Hyrum S. Anderson, Bobby Filar, and Mark McLean. 2021. Classifying Sequences of Extreme Length with Constant Memory Applied to Malware Detection.Proceedings of the AAAI Conference on Artificial Intelligence35, 11 (May 2021), 9386–9394. doi:10.1609/aaai.v35i11.17131

  49. [49]

    Sebastian Raschka. 2018. Model Evaluation, Model Selection, and Algorithm Selection in Machine Learning. http://arxiv.org/abs/1811.12808

  50. [50]

    Raffaele Sabato, Phil Stokes, and Tom Hegel. 2025. BlueNoroff Hidden Risk | Threat Actor Targets Macs with Fake Crypto News and Novel Persistence. https://www.sentinelone.com/labs/bluenoroff-hidden-risk-threat-actor- targets-macs-with-fake-crypto-news-and-novel-persistence/ Accessed: June 3, 2026

  51. [51]

    2022.Evaluation of Supervised and Unsupervised Machine Learning Classifiers for Mac OS Malware Detection

    Dilip Sahoo and Yash Dhawan. 2022.Evaluation of Supervised and Unsupervised Machine Learning Classifiers for Mac OS Malware Detection. Springer International Publishing, Cham, 159–175. https://doi.org/10.1007/978-3-030-74753-4_11

  52. [52]

    Aidan Steele. 2025. OS X ABI Mach-O File Format Reference. https://github.c om/aidansteele/osx-abi-macho-file-format-reference Accessed: June 3, 2026

  53. [53]

    Phil Stokes. 2021. Massive New AdLoad Campaign Goes Entirely Undetected By Apple’s XProtect. https://www.sentinelone.com/labs/massive-new-adload- campaign-goes-entirely-undetected-by-apples-xprotect/ Accessed: June 3, 2026

  54. [54]

    Phil Stokes. 2025. macOS Adload: Prolific Adware Pivots Just Days After Apple’s XProtect Clampdown. https://www.sentinelone.com/blog/macos-adload- prolific-adware-pivots-just-days-after-apples-xprotect-clampdown/ Accessed: June 3, 2026

  55. [55]

    Phil Stokes. 2025. macOS Cuckoo Stealer | Ensuring Detection and Defense as New Samples Rapidly Emerge. https://www.sentinelone.com/blog/macos-cuck Montaruli, et al. oo-stealer-ensuring-detection-and-defense-as-new-samples-rapidly-emerge/ Accessed: June 3, 2026

  56. [56]

    Andrew Thaeler, Yagmur Yigit, Leandros Maglaras, William J Buchanan, Nagh- meh Moradpoor, and Gordon Russell. 2023. Enhancing Mac OS Malware De- tection through Machine Learning and Mach-O File Analysis. In2023 IEEE 28th International Workshop on Computer Aided Modeling and Design of Communica- tion Links and Networks (CAMAD). 170–175. doi:10.1109/CAMAD59...

  57. [57]

    Romain Thomas. 2017. LIEF - Library to Instrument Executable Formats. https: //lief.quarkslab.com/

  58. [58]

    Dmitrijs Trizna, Luca Demetrio, Battista Biggio, and Fabio Roli. 2024. Nebula: Self-Attention for Dynamic Malware Analysis.IEEE Transactions on Information Forensics and Security19 (2024), 6155–6167. doi:10.1109/TIFS.2024.3409083

  59. [59]

    VirusSamples Team. 2025. MacOS Malware Samples - A Collection of MacOS Malware Binaries. https://github.com/MalwareSamples/Macos-Malware- Samples Accessed: July 30, 2024

  60. [60]

    VirusShare Team. 2025. VirusShare.com - Because Sharing is Caring. https: //virusshare.com Accessed: June 3, 2026

  61. [61]

    VirusTotal. 2025. VirusTotal - Free Online Virus, Malware and URL Scanner. https://www.virustotal.com/ Accessed: June 3, 2026

  62. [62]

    Elizabeth Walkup. 2014. Mac Malware Detection via Static File Structure Analysis. https://cs229.stanford.edu/proj2014/Elizabeth%20Walkup,%20MacMalware.pdf

  63. [63]

    Liyuan Wang, Xingxing Zhang, Hang Su, and Jun Zhu. 2024. A Comprehensive Survey of Continual Learning: Theory, Method and Application.IEEE Trans- actions on Pattern Analysis and Machine Intelligence46, 8 (2024), 5362–5383. doi:10.1109/TPAMI.2024.3367329

  64. [64]

    2022.The Art of Mac Malware: The Guide to Analyzing Malicious Software

    Patrick Wardle. 2022.The Art of Mac Malware: The Guide to Analyzing Malicious Software. No Starch Press. https://taomm.org/vol1/read.html

  65. [65]

    2025.The Art of Mac Malware, Volume 2: Detecting Malicious Software

    Patrick Wardle. 2025.The Art of Mac Malware, Volume 2: Detecting Malicious Software. No Starch Press. https://taomm.org/vol2/read.html

  66. [66]

    George D Webster, Bojan Kolosnjaji, Christian von Pentz, Julian Kirsch, Zachary D Hanif, Apostolis Zarras, and Claudia Eckert. 2017. Finding the needle: A study of the pe32 rich header and respective malware triage. InDetection of Intrusions and Malware, and Vulnerability Assessment: 14th International Con- ference, DIMV A 2017, Bonn, Germany, July 6-7, 2...