AI Glossary: What Is Hyperparameters? Definition & Meaning

Im Bereich der künstlichen Intelligenz verwendet wird and maschinellem Lernen, hyperparameters are crucial settings that dictate how a model learns from data. Unlike parameters, which are learned by the model during training (such as weights in a neural network), hyperparameters are set before the training process begins and remain constant throughout.

Hyperparameter können die Leistung eines Modells erheblich beeinflussen. Einige gängige Beispiele sind:

Lernrate: This determines the step size at each iteration while moving toward a minimum of a loss function. A learning rate that is too high can cause the model to converge too quickly to a suboptimal solution, while a learning rate that is too low can make the training process painfully slow.
Stapelgröße: This refers to the number of training examples utilized in one iteration. A larger batch size can lead to faster training but may also result in less accurate updates to the model weights.
Anzahl der Epochen: An epoch is one complete pass through the entire training dataset. Setting the right number of epochs is crucial; too few can lead to underfitting, while too many can lead to overfitting.
Regularisierungsparameter: These are used to prevent overfitting by adding a penalty for larger coefficients in the model. Common techniques include L1 and L2-Regularisierung.

Choosing the right hyperparameters often requires experimentation and can be guided by techniques such as grid search or random search, which systematically explore different combinations of hyperparameters. Advanced methods like Bayessche Optimierung kann auch für effizientere Suchvorgänge verwendet werden.

Zusammenfassend sind Hyperparameter grundlegend für das Training und die Leistung von Machine-Learning-Modellen, und ihre sorgfältige Abstimmung kann den Unterschied zwischen einem mittelmäßigen und einem äußerst effektiven Modell ausmachen.