Data Augmentation๏
This section covers data augmentation techniques for NIRS data.
Overview๏
Data augmentation is a powerful technique to improve model robustness, especially when working with limited training data. NIRS4ALL supports both sample-level and feature-level augmentation.
Overview of augmentation methods and strategies.
Complete guide to augmenting training data with synthetic samples, including balanced mode for imbalanced datasets.
Generate synthetic NIRS spectra for testing and validation.
Types of Augmentation๏
Sample Augmentation๏
Create new training samples by applying transformations to existing spectra:
Noise injection
Baseline shifts
Wavelength perturbations
Mixup strategies
Feature Augmentation๏
Generate multiple views of the same data through different preprocessing:
Multiple preprocessing variants
Different spectral regions
Derivative combinations
Quick Example๏
from nirs4all.operators.augmentation import SampleAugmenter
pipeline = [
{"sample_augmentation": SampleAugmenter(
noise_level=0.01,
n_augmented=3
)},
# ... rest of pipeline
]
See Also๏
Preprocessing - Preprocessing techniques
Operator Catalog - All available operators