Cosine Distance is a metric used to determine the similarity between two non-zero vectors in an producto interno space. It is defined as one minus the cosine of the angle between the vectors. This measure is particularly useful in various fields like análisis de texto, sistemas de recomendación, and aprendizaje automático, where the orientation of the data points is more significant than their magnitude.
Matemáticamente, la distancia coseno puede expresarse como:
Cosine Distance(A, B) = 1 – (A • B) / (||A|| ||B||)
Donde:
- A • B is the producto punto de vectores A y B.
- ||A|| and ||B|| son las magnitudes (o longitudes) de los vectores A y B, respectivamente.
El valor de la distancia coseno varía de 0 a 2. Una distancia coseno de 0 indica que los dos vectores son idénticos en dirección, mientras que un valor de 1 indica que los vectores son ortogonales (completamente disímiles).
Cosine distance is particularly effective for high-dimensional data, such as text represented as word vectors in procesamiento de lenguaje natural. In such cases, it helps in identifying how similar two documents are based on the context of the words used, rather than their frequency, making it a robust measure for various AI applications.