Linguistics

Explore 21 AI terms in Linguistics

CoLA

CoLA

CoLA stands for the Corpus of Linguistic Acceptability, a dataset for evaluating linguistic models.

Computational humor

CH

Computational humor involves using algorithms to generate or understand jokes and humor.

Computational linguistics

CL

Computational linguistics is the study of using computer algorithms to process and analyze human language.

Constructed language

Conlang

A constructed language is an artificially created language designed for specific purposes, such as communication, art, or experimentation.

Corpus Linguistics

Corpus linguistics is the study of language through large collections of texts, known as corpora.

Dialogue Act

DA

A dialogue act is a communicative function of a segment of conversation, indicating the speaker's intention.

Digraph

A digraph is a pair of letters used together to represent a single sound or phoneme.

Europarl Corpus

EPC

The Europarl Corpus is a multilingual dataset of proceedings from the European Parliament, useful for language processing tasks.

Glyph

A glyph is a visual symbol representing a character or concept in writing or graphic design.

Lexical Diversity

LD

Lexical diversity measures the range of unique words used in a text or speech relative to the total number of words.

Linguistic Analysis

Linguistic Analysis involves examining language structure, meaning, and use to understand communication patterns.

Linguistic Feature

Linguistic features are characteristics of language that influence communication and understanding.

Linguistic Processing

Linguistic Processing involves analyzing and understanding human language through computational methods.

Linguistic Rule

Linguistic rules are formalized guidelines that dictate the structure and use of language.

Morpheme Segmentation

Morpheme segmentation is the process of breaking down words into their smallest meaningful units, called morphemes.

Natural Language Processing

NLP

Natural Language Processing (NLP) enables computers to understand, interpret, and respond to human language.

OntoNotes

ON

OntoNotes is a large-scale annotated corpus used in natural language processing tasks.

OPUS Corpus

OPUS

OPUS Corpus is a collection of multilingual parallel corpora used for natural language processing tasks.

Polysemanticity

Polysem.

Polysemanticity refers to a word or phrase having multiple meanings or interpretations.

Reading Comprehension

RC

Reading comprehension is the ability to understand and interpret written text.

Vocabulary

Vocabulary refers to the set of words known and used by individuals or groups.

Back to All Terms
Ctrl + /