AI Glossary: What Is Log Loss? Definition & Meaning

ログ損失, also known as logistic loss or クロスエントロピー損失, is a performance metric used primarily in 二値分類 problems where the predicted output is a probability value ranging between 0 and 1. It quantifies the difference between the predicted probabilities and the actual class labels (0 or 1). The objective of using log loss is to evaluate how well a classification model predicts probabilities for binary outcomes, with lower values indicating better モデルのパフォーマンス.

数学的には、ログ損失は次の式を用いて計算されます：

Log Loss = -1/N * Σ [y * log(p) + (1 – y) * log(1 – p)]

ここで：

N の総数です。
y 実際のラベル（0または1）です。
p 正のクラス（1）の予測確率です。

The Log Loss value ranges from 0 to infinity, where 0 indicates perfect predictions (model outputs probabilities of either 0 or 1 correctly) and larger values indicate worse performance. A model that predicts probabilities close to the true labels will have a lower log loss, while a model that predicts probabilities far 真のラベルから外れる予測はより高いログ損失をもたらします。

Log Loss is particularly useful in scenarios where the output is not just a hard classification but a probability, making it suitable for applications such as ロジスティック回帰, neural networks, and other probabilistic classifiers. It is also widely 機械学習で使用される Kaggleなどのコンペティションでモデルの性能を評価するために。