Part of a dataset used for assessing a models performance (via doing predictions and determining statistical metrics on them.) Should never be used for training the model.
See also Training set.