Albumentations
Albumentations is an open-source image augmentation library introduced in 2018 by researchers Alexander Buslaev, Vladimir Iglovikov, and Alex Parinov. The library provides a flexible and efficient framework for applying data augmentation techniques in computer vision workflows. Additional core contributors include Eugene Khvedchenya and Mikhail Druzhinin.
Albumentations is built on top of the OpenCV library and provides optimized implementations of many image-processing operations. Its API enables composition of augmentation pipelines for tasks including image classification, object detection, semantic segmentation, instance segmentation, keypoint detection, and multi-target workflows.
Adoption
Albumentations is widely used in academic research, industrial machine learning pipelines, and Kaggle competitions.The project is an Affiliated Project of NumFOCUS, a non-profit organization supporting open-source scientific computing.
The Albumentations research paper has received more than 3000 citations according to Google Scholar.
The library has been used in solutions to computer vision challenges, including the Kaggle Deepfake Detection Challenge.
Package analytics indicate more than 100 million cumulative installations.
Features
Albumentations supports deterministic and randomized augmentation pipelines for a broad range of annotation types.2D image and annotation targets
Supported 2D targets include:- Images in RGB, grayscale, and arbitrary multi-channel configurations
- Segmentation masks and dense label maps with any number of channels, including instance segmentation multi-mask workflows
- Bounding boxes in COCO, Pascal VOC, YOLO, and Albumentations formats
- Keypoints in multiple coordinate systems
- multiple images,
- multiple masks,
- multiple bounding-box sets,
- multiple keypoint sets.
A full compatibility matrix of transforms and supported targets is documented online.
Video and temporal data
Albumentations supports consistent augmentation of video sequences by applying transforms with shared random parameters across frames.Volumetric (3D) data
Albumentations supports 3D augmentation for medical imaging and scientific computing, including:- 3D images with arbitrary channel counts,
- 3D multi-class or multi-channel masks,
- 3D keypoints.
Fork and successor project
In 2024–2025, the original Albumentations project entered maintenance mode. A successor project, AlbumentationsX, was created to extend functionality and continue active development.AlbumentationsX expands support for 2D, 3D, and video targets, introduces new transformations, and provides internal performance improvements while staying API-compatible with Albumentations.
AlbumentationsX uses a dual licensing model: AGPL-3.0 or a commercial license.
A benchmarking repository comparing AlbumentationsX with other libraries is publicly available.
Example
The following program applies the same augmentation pipeline to an image and its segmentation mask:import albumentations as A
import cv2
transform = A.Compose,
A.HorizontalFlip,
A.RandomBrightnessContrast,
])
image = cv2.imread
image = cv2.cvtColor
mask = cv2.imread
transformed = transform
aug_image = transformed
aug_mask = transformed