Manifold Mixup ist eine innovative Methode der Datenaugmentation technique used in training Deep Learning models, particularly neuronale Netze. This method enhances the model’s ability to generalize by mixing samples from the Trainingsdaten, thereby creating new synthetic training examples.
The core idea behind Manifold Mixup is to interpolate between two data points and their corresponding labels in a way that respects the structure of the data manifold. This is done by selecting two input samples, x1 and x2, and a mixing coefficient λ (where 0 ≤ λ ≤ 1). The new mixed input xmix is calculated as:
xmix = λ * x1 + (1 - λ) * x2
Similarly, the corresponding labels y1 and y2 are combined using the same mixing coefficient:
ymix = λ * y1 + (1 - λ) * y2
This approach allows the model to learn from not just the original samples but also from their combinations, leading to improved robustness and performance. By training on these mixed samples, the model is less likely to overfit to the training data and can better handle variations in real-world data.
Manifold Mixup ist besonders nützlich bei Aufgaben wie Bildklassifikation, der Verarbeitung natürlicher Sprache, and other domains where complex data structures are prevalent. It represents a shift in how we think about data augmentation, emphasizing the importance of the relationships between data points rather than treating them as isolated instances.