AI Glossary: What Is Beam Search Decoding? Definition & Meaning

Beam Search Decoding is a heuristic search algorithm commonly used in natural language processing (NLP) and machine translation to generate sequences of outputs, such as sentences or translations. Unlike exhaustive search methods that explore all possible sequences, beam search balances efficiency and accuracy by maintaining a fixed number of the best candidates at each step, known as the ‘beam width’.

The algorithm starts with an initial state and iteratively expands candidates by considering possible next steps. At each iteration, it evaluates all potential continuations of the current sequences and keeps only the top ‘k’ sequences based on their likelihood scores, where ‘k’ is the beam width. This process continues until a stopping criterion is met, such as generating a specific token denoting the end of the sequence.

The primary advantage of beam search is that it reduces the computational load compared to exhaustive search methods while still providing robust and relevant output sequences. However, it also has some limitations: if the beam width is too narrow, it may miss the optimal sequence; conversely, if the beam width is too wide, it may lead to increased computation without significant gains in output quality. Thus, selecting an appropriate beam width is crucial for balancing performance and efficiency.

Beam search is widely employed in various AI applications, including speech recognition, text generation, and machine translation, where generating coherent and contextually relevant sequences is critical.