AI Glossary: What Is Out-of-Sample Evaluation? Definition & Meaning

Out-of-Sample（アウト・オブ・サンプル）評価 refers to the process of assessing the performance of an 人工知能 (AI) model using data that was not included during the model’s training phase. This evaluation is crucial for understanding how well the model can generalize its learned patterns to new, unseen data, which is a key indicator of its effectiveness in real-world applications.

AIと機械学習, models are trained on a specific dataset, known as the training set. However, if we only evaluate the model on this training set, we may obtain an overly optimistic view of its performance. This is because the model may simply memorize the 訓練データ instead of learning to generalize. To combat this issue, out-of-sample evaluation is performed using a separate dataset, often called the test set or validation set, which contains data that the model has not encountered before.

アウト・オブ・サンプル評価を行う一般的な手法には次のようなものがあります：

ホールドアウト法： Splitting the entire dataset into a training set and a test set. The model is trained on the training set and evaluated on the test set.
K-分割交差検証： Dividing the dataset into ‘k’ subsets. The model is trained ‘k’ times, each time using a different subset as the test set, while the remaining subsets are used for training. This method provides a more robust evaluation.
Leave-One-Out交差検証： A special case of k-fold cross-validation where ‘k’ is equal to the number of instances in the dataset. Each instance is used once as a test set while the remaining instances form the training set.

全体として、アウト・オブ・サンプル評価はモデル開発 lifecycle, ensuring that the AI system is reliable and effective in practical scenarios.