pith. sign in

arxiv: 1809.06839 · v1 · pith:ICRBEZBNnew · submitted 2018-09-18 · 💻 cs.CV

Albumentations: fast and flexible image augmentations

classification 💻 cs.CV
keywords imagealbumentationsaugmentationaugmentationsusedcommonlytransformationsavailable
0
0 comments X
read the original abstract

Data augmentation is a commonly used technique for increasing both the size and the diversity of labeled training sets by leveraging input transformations that preserve output labels. In computer vision domain, image augmentations have become a common implicit regularization technique to combat overfitting in deep convolutional neural networks and are ubiquitously used to improve performance. While most deep learning frameworks implement basic image transformations, the list is typically limited to some variations and combinations of flipping, rotating, scaling, and cropping. Moreover, the image processing speed varies in existing tools for image augmentation. We present Albumentations, a fast and flexible library for image augmentations with many various image transform operations available, that is also an easy-to-use wrapper around other augmentation libraries. We provide examples of image augmentations for different computer vision tasks and show that Albumentations is faster than other commonly used image augmentation tools on the most of commonly used image transformations. The source code for Albumentations is made publicly available online at https://github.com/albu/albumentations

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. MARVEL: Margin-Aware Robust von Mises-Fischer Expert Learning for Long-Tailed Out-of-Distribution Detection

    cs.CV 2026-07 unverdicted novelty 5.0

    MARVEL introduces a multi-expert NvMF-based system with an outlier expert that reduces FPR95 in OOD detection on medical datasets by 8-37%.

  2. Validation of an AI-based end-to-end model for prostate pathology using long-term archived routine samples

    cs.CV 2026-05 unverdicted novelty 4.0

    GleasonAI achieves quadratic-weighted kappa of 0.86 on ISUP grading of 10,366 long-term archived prostate biopsy cores, with performance stable over 17 years and a clear prognostic gradient for cancer-specific mortality.