AI Glossary: What Is Causal Language Model (CLM)? Definition & Meaning

Modèle de langage causal

Un Causal Modèle de langage (CLM) est un type de intelligence artificielle model that generates text by predicting the next word in a sequence based solely on the words that have come before it. This process is known as autoregression, where each word is generated one at a time, and each new word is dependent on the words that precede it.

CLMs are trained on large datasets of text, learning the statistical relationships between words and the context in which they appear. During training, the model learns to understand language patterns, grammar, and even some levels of semantics. Once trained, the model can generate coherent and contextually relevant text, making it useful for various applications, including chatbots, content generation, and tâches de traitement du langage naturel.

One key feature of causal language models is that they operate in a unidirectional manner. This means that when predicting the next word, they only consider the words to the left (or before) in the sequence, unlike other models such as bidirectional transformers that take into account the entire context. This characteristic allows CLMs to excel in tasks where the order and structure of the text matter, such as story generation or systèmes de dialogue.

Popular examples of Causal Language Models include OpenAI’s GPT (Transformateur pré-entraîné génératif) series. These models showcase the effectiveness of CLMs in generating human-like text, and they have been fine-tuned for various specific tasks, further enhancing their capabilities.