Esquecimento Catastrófico is a phenomenon observed in inteligência artificial, particularly in aprendizado de máquina models, where a model forgets previously learned information upon learning novos dados or tasks. This issue is especially prominent in neural networks that are trained sequentially on different tasks, leading to a decline in performance on earlier tasks.
O termo foi descrito pela primeira vez no contexto de psicologia teoria de aprendizagem, but it has significant implications in AI and machine learning. When a model is trained on one task and then retrained on a different task, it often fails to retain the knowledge acquired from the first task, resulting in poor performance on it. This is problematic for applications where continuous learning and adaptation are required.
Catastrophic forgetting is commonly attributed to the way neural networks update their weights during training. When new data is introduced, the adjustments made to the network’s parameters can interfere with the previously established representations, causing them to degrade or be overwritten. As a result, the model exhibits a high degree of sensitivity to the order in which tasks are presented.
Para mitigar o esquecimento catastrófico, pesquisadores têm explorado várias técnicas, incluindo:
- Regularização métodos: These help maintain the weights associated with previous tasks while allowing new learning.
- Redes com memória aumentada: These incorporate componentes de memória externa estruturas para armazenar informações de tarefas anteriores.
- Redes neurais progressivas: These architectures allocate separate resources for each task, preserving knowledge from earlier tasks.
Compreender e abordar o esquecimento catastrófico é crucial para o desenvolvimento sistemas de IA that can learn continuously and adapt to new information without losing valuable knowledge.