AI Glossary: AI Metrics Terms & Definitions

Absolute Error

AE

Absolute Error measures the difference between a predicted value and the actual value, indicating the accuracy of a model.

Baseline Accuracy

Baseline accuracy is the minimum accuracy a model must achieve to be considered effective.

Benchmark Saturation

Benchmark saturation refers to the point at which adding more benchmarks does not yield significant improvements in performance assessment.

CLIP Score

CLIP

CLIP Score measures the alignment between images and text based on AI models, aiding in evaluating visual and textual content.

Cosine Distance

Cosine Distance measures similarity between two vectors as the cosine of the angle between them.

Divergence Metric

A divergence metric quantifies the difference between two probability distributions in machine learning.

Error Rate

Error Rate measures the frequency of incorrect predictions made by an AI model compared to the total predictions.

Evaluating AI

Evaluating AI involves assessing AI systems to ensure effectiveness, accuracy, and alignment with intended goals.

Goodness of Fit

Goodness of Fit measures how well a statistical model aligns with observed data.

Human Baseline

Human Baseline refers to the standard performance level of humans used for evaluating AI systems.

Inception Score

IS

Inception Score measures the quality of generated images based on their clarity and diversity.

Macro-Average

Macro-Average calculates the overall performance of a model across multiple classes in classification tasks.

Mean Average Precision

MAP

Mean Average Precision (MAP) measures the accuracy of ranked retrieval results in information retrieval systems.

Model Assessment

Model Assessment evaluates the performance and reliability of machine learning models.

Model Diagnostics

Model diagnostics assess the performance and reliability of AI models using various metrics and techniques.

Model Evaluation

Model evaluation assesses the performance of AI models using various metrics and techniques.

Model Metric

Model Metric refers to quantifiable measures used to assess the performance of AI models.

Model Score

A model score quantifies the performance of an AI model on a specific task, often using metrics like accuracy or F1-score.

Model Statistics

Model Statistics refer to key metrics used to evaluate AI models' performance and effectiveness.

NDCG Metric

NDCG

NDCG is a metric for evaluating the effectiveness of information retrieval systems based on the graded relevance of retrieved items.

Negative Predictive Value

NPV

Negative Predictive Value (NPV) measures the accuracy of a test in identifying negative cases.

Normalized Discounted Cumulative Gain

NDCG

Normalized Discounted Cumulative Gain (NDCG) measures the effectiveness of ranked retrieval results.

Normalized Frequency

Normalized frequency is a statistical measure used to compare data distributions relative to a total count.

Normalized Output

Normalized Output refers to the adjusted values produced by AI models to improve consistency and comparability.

Objective Measure

Objective measures quantify performance or outcomes based on unbiased data, ensuring consistency and comparability.

Online Metric

Online metrics are performance indicators used to evaluate the effectiveness of online activities.

Optimization Metric

An optimization metric is a quantitative measure used to assess the performance of algorithms or models in AI optimization tasks.

Out-of-Sample Error

Out-of-sample error measures model performance on unseen data, indicating generalization ability beyond training data.