Unidade Recorrente Gated (GRU)
Uma Unidade Recorrente com Portão (GRU) é um tipo especializado de rede neural recorrente (RNN) architecture designed to handle sequential data more effectively. It was introduced by Kyunghyun Cho et al. in 2014 as a simpler alternative to the Memória de Longo Prazo redes (LSTM).
Os GRUs são particularmente úteis em tarefas que envolvem previsão de séries temporais, processamento de linguagem natural, and other applications where data is ordered in sequences. The key innovation of GRUs is their use of gating mechanisms that help the network learn which information to keep or discard as it processes the input sequence.
Existem dois gates principais em um GRU:
- Gate de Atualização: This gate determines how much of the past information needs to be passed along to the future. It controls the flow of information from the previous time step to the current time step, helping the model retain relevant context.
- Gate de Reset: This gate decides how much of the past information to forget. It allows the model to reset its memory when processing new inputs, making it flexible and efficient in learning temporal dependencies.
One of the advantages of GRUs compared to LSTMs is their simpler architecture, which generally leads to faster training times and lower computational costs. Despite this, GRUs are often found to perform similarly to LSTMs in various tasks, making them a popular choice in aplicações de aprendizado profundo.
In summary, GRUs are powerful tools for handling sequential data, providing a balance between complexity and performance, and are widely used in modern aplicações de IA.