AI Glossary: What Is Scaling Laws (SL)? Definition & Meaning

Les lois d’échelle dans le contexte de intelligence artificielle (AI) refer to the observable patterns that indicate how the performance of apprentissage automatique models improves as the size of the model, the amount of données d'entraînement, or both increase. These laws suggest that larger models trained on more data tend to achieve better performance, often following a predictable curve.

In recherche en IA, scaling laws have been particularly influential in understanding the capabilities of large réseaux neuronaux. For instance, as the number of parameters in a model increases, the model’s ability to generalize from training data to unseen data often improves. This relationship can be quantified mathematically, typically expressed in terms of power laws, where métriques de performance (comme la précision ou la perte) sont tracées en fonction de la taille du modèle ou du jeu de données.

Researchers have found that these scaling relationships can help predict how a model’s performance will change with varying sizes or amounts of data, allowing for more efficient resource allocation when developing AI systems. For example, if a model’s performance improves consistently with increased size, a team might decide to invest in more ressources informatiques pour faire évoluer leurs modèles afin d’obtenir de meilleurs résultats.

However, it’s important to note that scaling laws do not hold indefinitely; there are diminishing returns at very large scales where performance improvements may plateau despite increasing model sizes or data. Understanding these limits is crucial for AI practitioners to avoid wasted resources and to implement models that are both efficient and effective.