The Word-in-Context dataset is a word sense disambiguation task in the form of a binary classification task. Each dataset example consists of two sentences and a polysemous word that appears in both. The task then is to determine whether the word is used with the same sense in both sentences.