Operator Catalog
This catalog enumerates every pipeline operator exposed across the nirs4all backend and the nirs4all_ui component library after the October 2025 restructuring. The component manifest is generated from scripts/generate_component_library.py, which populates public/component-library.json by introspecting the codebase and the installed scikit-learn distribution.
Category Overview
Category |
Feather icon |
Subcategories |
Components |
Description |
|---|---|---|---|---|
Augmentation |
|
1 |
8 |
Spectral augmentation operators to improve robustness |
Spectral Preprocessing |
|
7 |
18 |
Baseline, scatter, smoothing, derivatives, and spectral transforms |
Feature Engineering |
|
7 |
45 |
scikit-learn TransformerMixin utilities and feature builders |
Dimension Reduction |
|
3 |
39 |
Operators that change feature dimensionality |
Classical Models |
|
16 |
99 |
scikit-learn estimators (regressors, classifiers, wrappers) |
Deep Learning |
|
1 |
29 |
TensorFlow models bundled with nirs4all |
Validation & Splitting |
|
1 |
17 |
Cross-validation and sampling strategies |
Target Processing |
|
1 |
2 |
Transformations applied to the target variable ( |
Prediction & Outputs |
|
1 |
3 |
Prediction helpers and probability calibration |
Pipeline Utilities |
|
3 |
11 |
Containers, generators, and visualization helpers |
Augmentation
Spectral Augmentation –
Rotate & Translate,Random X Operation,Spline Smoothing,Spline X Perturbations,Spline Y Perturbations,Spline X Simplification,Spline Curve Simplification,Identity Augmenter
Spectral Preprocessing
Baseline Correction –
Baseline Removal,DetrendScatter & Normalization –
MSC,Standard Normal Variate,Robust Normal VariateSmoothing –
Savitzky-Golay,Gaussian FilterDerivatives –
First Derivative,Second Derivative,Sample DerivativeSpectral Transforms –
Wavelet Transform,Haar Wavelet,Log TransformNIRS Scaling –
Normalize Rows,Simple ScaleResampling & Alignment –
Adaptive Resampler,Crop Transformer,Resample Transformer
Feature Engineering (scikit-learn TransformerMixin)
scikit-learn Scalers –
Binarizer,FunctionTransformer,KernelCenterer,MaxAbsScaler,MinMaxScaler,Normalizer,PolynomialFeatures,PowerTransformer,QuantileTransformer,RobustScaler,SplineTransformer,StandardScalerEncoding & Binning –
KBinsDiscretizer,LabelBinarizer,LabelEncoder,MultiLabelBinarizer,OneHotEncoder,OrdinalEncoder,TargetEncoderImputation –
KNNImputer,MissingIndicator,SimpleImputerDimensionality Reduction –
CCA,DictionaryLearning,FactorAnalysis,FastICA,IncrementalPCA,Isomap,KernelPCA,LatentDirichletAllocation,LocallyLinearEmbedding,MiniBatchDictionaryLearning,MiniBatchNMF,MiniBatchSparsePCA,NMF,PCA,PLSCanonical,PLSRegression,PLSSVD,SparseCoder,SparsePCA,TSNE,TruncatedSVDFeature Selection –
GenericUnivariateSelect,RFE,RFECV,SelectFdr,SelectFpr,SelectFromModel,SelectFwe,SelectKBest,SelectPercentile,SequentialFeatureSelector,VarianceThresholdKernel & Projection –
AdditiveChi2Sampler,GaussianRandomProjection,Nystroem,PolynomialCountSketch,RBFSampler,SkewedChi2Sampler,SparseRandomProjectionFeature Extraction –
DictVectorizer,FeatureHasher,HashingVectorizer,PatchExtractor,TfidfTransformerCluster & Neighbors –
Birch,BisectingKMeans,FeatureAgglomeration,KMeans,KNeighborsTransformer,MiniBatchKMeans,NeighborhoodComponentsAnalysis,RadiusNeighborsTransformerMeta Transformers –
ColumnTransformer,FeatureUnion,RandomTreesEmbedding,StackingClassifier,StackingRegressor,VotingClassifier,VotingRegressorMiscellaneous Transformers –
BernoulliRBM,IsotonicRegression,LinearDiscriminantAnalysis
Dimension Reduction
Dimensionality Reduction -
CCA,DictionaryLearning,FactorAnalysis,FastICA,IncrementalPCA,Isomap,KernelPCA,LatentDirichletAllocation,LocallyLinearEmbedding,MiniBatchDictionaryLearning,MiniBatchNMF,MiniBatchSparsePCA,NMF,PCA,PLSCanonical,PLSRegression,PLSSVD,SparseCoder,SparsePCA,TSNE,TruncatedSVDFeature Selection -
GenericUnivariateSelect,RFE,RFECV,SelectFdr,SelectFpr,SelectFromModel,SelectFwe,SelectKBest,SelectPercentile,SequentialFeatureSelector,VarianceThresholdKernel & Projection -
AdditiveChi2Sampler,GaussianRandomProjection,Nystroem,PolynomialCountSketch,RBFSampler,SkewedChi2Sampler,SparseRandomProjection
Note: Transformer lists are derived automatically via
sklearn.utils.all_estimators(type_filter="transformer"), ensuring parity with the installed scikit-learn version.
Classical Models (scikit-learn estimators)
Baseline Models –
DummyClassifier,DummyRegressorCross Decomposition –
CCA,PLSCanonical,PLSRegressionDecision Trees –
DecisionTreeClassifier,DecisionTreeRegressor,ExtraTreeClassifier,ExtraTreeRegressorDiscriminant Analysis –
LinearDiscriminantAnalysis,QuadraticDiscriminantAnalysisEnsemble Methods –
AdaBoostClassifier,AdaBoostRegressor,BaggingClassifier,BaggingRegressor,ExtraTreesClassifier,ExtraTreesRegressor,GradientBoostingClassifier,GradientBoostingRegressor,HistGradientBoostingClassifier,HistGradientBoostingRegressor,RandomForestClassifier,RandomForestRegressor,StackingClassifier,StackingRegressor,VotingClassifier,VotingRegressorGaussian Process –
GaussianProcessClassifier,GaussianProcessRegressorKernel Ridge & Friends –
KernelRidgeLinear Models –
ARDRegression,BayesianRidge,ElasticNet,ElasticNetCV,GammaRegressor,HuberRegressor,Lars,LarsCV,Lasso,LassoCV,LassoLars,LassoLarsCV,LassoLarsIC,LinearRegression,LogisticRegression,LogisticRegressionCV,MultiTaskElasticNet,MultiTaskElasticNetCV,MultiTaskLasso,MultiTaskLassoCV,OrthogonalMatchingPursuit,OrthogonalMatchingPursuitCV,PassiveAggressiveClassifier,PassiveAggressiveRegressor,Perceptron,PoissonRegressor,QuantileRegressor,RANSACRegressor,Ridge,RidgeCV,RidgeClassifier,RidgeClassifierCV,SGDClassifier,SGDRegressor,TheilSenRegressor,TweedieRegressorMeta Estimators –
ClassifierChain,MultiOutputClassifier,MultiOutputRegressor,OneVsOneClassifier,OneVsRestClassifier,OutputCodeClassifier,RegressorChain,TransformedTargetRegressorNaive Bayes –
BernoulliNB,CategoricalNB,ComplementNB,GaussianNB,MultinomialNBNearest Neighbors –
KNeighborsClassifier,KNeighborsRegressor,NearestCentroid,RadiusNeighborsClassifier,RadiusNeighborsRegressorNeural Networks (sklearn) –
MLPClassifier,MLPRegressorProbabilistic & Calibration –
CalibratedClassifierCV,FixedThresholdClassifier,IsotonicRegression,TunedThresholdClassifierCVSemi-supervised –
LabelPropagation,LabelSpreading,SelfTrainingClassifierSupport Vector Machines –
LinearSVC,LinearSVR,NuSVC,NuSVR,SVC,SVRMiscellaneous Models – (currently empty; all estimators are classified above)
Note: Model listings are produced via
sklearn.utils.all_estimatorsfor both classifiers and regressors, preserving compatibility with the runtime environment.
Deep Learning (TensorFlow)
CONV_LSTM, Custom_Residuals, Custom_VG_Residuals, Custom_VG_Residuals2, FFT_Conv, MLP, ResNetV2_model, SEResNet_model, UNET, UNet_NIRS, VGG_1D, XCeption1D, bard, customizable_decon, customizable_nicon, customizable_nicon_classification, decon, decon_classification, decon_layer_classification, inception1D, nicon, nicon_VG, nicon_VG_classification, nicon_classification, senseen_origin, transformer, transformer_VG, transformer_VG_classification, transformer_classification
These factories are tagged via the framework("tensorflow") decorator and are surfaced under the TensorFlowModelController.
Validation & Splitting
Splitting Strategies -
Shuffle Split,K-Fold,Stratified K-Fold,Repeated K-Fold,Repeated Stratified K-Fold,Group K-Fold,Group Shuffle Split,Stratified Shuffle Split,Time Series Split,Leave-One-Out,Leave-P-Out,Kennard-Stone Splitter,SPXY Splitter,KMeans Splitter,SPlit Splitter,Systematic Circular,KBins Stratified
Target Processing
Target Transforms –
Integer KBins Discretizer,Range Discretizer
Prediction & Outputs
Prediction Utilities –
Batch Prediction,Real-time Prediction,Probability Calibration
Pipeline Utilities
Containers -
Feature Augmentation,Sample Augmentation,Sequential,Pipeline,Y Processing(augmentation/processing containers accept only preprocessing, augmentation, or feature-engineering nodes;Y Processingis limited to target transforms;Pipelineremains unrestricted)Generators –
_OR_,_RANGE_(parameter sweep and branching)Visualization –
2D Chart,Y Distribution Chart,Fold Chart
Maintenance Notes
Run
python scripts/generate_component_library.pyfromnirs4all_uito regenerate the library after adding new operators. The script introspects nirs4all modules and callssklearn.utils.all_estimatorsto keep the catalog aligned with the installed version.UI components consume
public/component-library.json; the pipeline editor reads this file vialibraryDataLoader.Container rules use category tokens (e.g.
category:preprocessing) so augmentation and preprocessing containers reject models or incompatible nodes whilePipelinestays unrestricted.If new TensorFlow models are added under
nirs4all.operators.models, ensure they carry theframework("tensorflow")decorator so they are picked up automatically.