AI Glossary: What Is Stacking? Definition & Meaning

Stacking im maschinellen Lernen

Stacking, oder gestapelte Generalisierung, ist eine Ensemble-Lerntechnik im maschinellen Lernen to improve the accuracy of predictions by combining the strengths of multiple models. The core idea behind stacking is to build a new model that learns how to best combine the predictions from several base models, also known as level-0 models.

Der Prozess umfasst typischerweise zwei Hauptphasen:

Training der Basis-Modelle: In the first stage, various base models (like decision trees, neural networks, or Support-Vektor-Maschinen) are trained on the same dataset. Each model may capture different patterns and aspects of the data, which contributes to the diversity necessary for effective ensemble learning.
Training des Meta-Modells: In the second stage, a new model, called the meta-model or level-1 model, is trained using the predictions made by the base models as input features. This meta-model learns to weigh the predictions from each base model to produce a final prediction.

Stacking kann zu erheblichen Verbesserungen in Modellleistung, as it reduces the likelihood of overfitting by leveraging multiple learning algorithms. Common techniques used in stacking include cross-validation to ensure that the base models are trained on different subsets of the data, thereby enhancing the robustness of the meta-model.

Stacking is a powerful approach in various applications, including classification, regression, and even complex domains like der Verarbeitung natürlicher Sprache and image recognition. While it may require more computational resources than single-model approaches, the potential gain in predictive performance often justifies the added complexity.