AICurious Logo

What is: LightAutoML?

SourceLightAutoML: AutoML Solution for a Large Financial Services Ecosystem
Year2000
Data SourceCC BY-SA - https://paperswithcode.com

LightAutoML is an AutoML solution targeted for financial services companies. A typical LightAutoML pipeline scheme is presented in the Figure, each pipeline containing:

  • Reader: object that receives raw data and task as input, calculates some useful metadata, performs initial data cleaning and decides about data manipulations that should be done before fitting different model types.

  • LightAutoML inner datasets that contains metadata and CV iterators that implements validation scheme for the datasets.

  • Multiple ML Pipelines that are stacked and/or blended to get a single prediction.

An ML pipeline in LightAutoML is one or multiple ML models that share a single data preprocessing and validation scheme. The preprocessing step may have up to two feature selection steps, a feature engineering step or even just be empty if no preprocessing is needed. The ML pipelines can be computed independently on the same datasets and then blended together using averaging (or weighted averaging). Alternatively, a stacking ensemble scheme can be used to build multi level ensemble architectures.