AI Glossary: What Is Feature Squeezing? Definition & Meaning

Feature squeezing is a technique used in the realm of adversarial machine learning to enhance the robustness of machine learning models against adversarial attacks. The core idea behind feature squeezing is to simplify the input data by reducing its dimensionality or complexity, which can help in mitigating the effects of small perturbations that adversaries might introduce to mislead the model.

This method typically involves two main strategies: bit depth reduction and spatial smoothing. Bit depth reduction refers to decreasing the number of bits used to represent each pixel in an image (for example, from 8 bits to 4 bits), which effectively reduces the number of distinct values that can be assigned to each pixel. This in turn minimizes the potential for subtle changes that could confuse the model. Spatial smoothing, on the other hand, applies techniques such as averaging or Gaussian smoothing to the input data, which can help in removing noise and irrelevant details that adversaries might exploit.

By applying feature squeezing, the model’s sensitivity to adversarial perturbations can be significantly reduced, leading to improved performance in real-world applications where data might be subject to manipulation or noise. However, it is important to balance the degree of feature squeezing with potential loss of information, as overly aggressive squeezing may lead to a deterioration in the model’s overall performance on legitimate inputs. As such, feature squeezing is often used in conjunction with other defense mechanisms to provide a more comprehensive approach to enhancing model security.