AdapterHub - Explore

Argument Mining

UKP Sentential Argument Mining argument/ukpsent

The UKP Sentential Argument Mining Corpus includes 25,492 sentences over eight controversial topics. Each sentence was annotated via crowdsourcing as either a supporting argument, an attacking... Argument Mining

Common Sense Reasoning

COPA comsense/copa

COPA (Gordon et al., 2012) is a causal commonsense reasoning task with the goal to select the more plausible sentence of two alternatives, given a premise sentence. Common Sense Reasoning

Cosmos QA comsense/cosmosqa

Cosmos QA is a large-scale dataset of 35,600 problems that require commonsense-based reading comprehension, formulated as multiple-choice questions. Common Sense Reasoning

Commonsense QA comsense/csqa

To investigate question answering with prior knowledge, we present CommonsenseQA: a challenging new dataset for commonsense question answering. To capture common sense beyond associations, we... Common Sense Reasoning

HellaSwag comsense/hellaswag

HellaSwag, a new benchmark for commonsense NLI. Common Sense Reasoning

Social IQA comsense/siqa

Social IQa is the first largescale benchmark for commonsense reasoning about social situations. Social IQa contains 38,000 multiple choice questions for probing emotional and social intelligence... Common Sense Reasoning

WinoGrande comsense/winogrande

WinoGrande is a new collection of 44k problems, inspired by Winograd Schema Challenge (Levesque, Davis, and Morgenstern 2011), but adjusted to improve the scale and robustness against the... Common Sense Reasoning

Dependency Parsing

UD English EWT dp/ud_ewt

Dependency parsing on Universal Dependencies English EWT. UD English EWT is a Gold Standard Universal Dependencies Corpus for English, built over the source material of the English Web Treebank... Dependency Parsing

Dependency Relation Classification

UD English EWT deprel/ud_ewt

Dependency relation classification on Universal Dependencies English EWT. UD English EWT is a Gold Standard Universal Dependencies Corpus for English, built over the source material of the English... Dependency Relation Classification

Dialect Detection

ArabicDialect dialect/arabic

Arabic Dialect Detection classifies text into Modern Standard Arabic (MSA), Egyptian, Maghrebi (northwest Africa), Gulf, and Levantine Dialect Detection

Formality Classification

Grammarly's Yahoo Answers Formality Corpus (GYAFC) formality_classify/gyafc

The Grammarly’s Yahoo Answers Formality Corpus (GYAFC) is the largest dataset for formality classification which contains a total of 110K informal / formal sentence pairs. This is a... Formality Classification

Grammatical Error Detection

CLC FCE Dataset ged/fce

The First Certificate in English (FCE) dataset in sequence labeling format, converted by Rei and Yannakoudakis (2016), wher eeach token in a sentence is labelled either as correct or as incorrect.... Grammatical Error Detection

Language Modeling

poem lm/poem

The model is trained to learn the structure of poems in the english language. Language Modeling

Linguistic Acceptability

CoLA lingaccept/cola

The Corpus of Linguistic Acceptability (CoLA) in its full form consists of 10657 sentences from 23 linguistics publications, expertly annotated for acceptability (grammaticality) by their original... Linguistic Acceptability

Machine Reading Comprehension

MultiRC rc/multirc

The Multi-Sentence Reading Comprehension dataset (MultiRC, Khashabi et al., 2018) is a true/false question-answering task. Each example consists of a context paragraph, a question about that... Machine Reading Comprehension

RACE rc/race

Race is a large-scale reading comprehension dataset with more than 28,000 passages and nearly 100,000 questions. The dataset is collected from English examinations in China, which are designed for... Machine Reading Comprehension

ReCoRD rc/record

ReCoRD is a reading comprehension task requiring commonsense reasoning. The context paragraphs and queries are generated from curated CNN/ Daily Mail news articles. The task itself is formulated... Machine Reading Comprehension

Machine Translation

WMT16 English-Romanian mt/wmt16_en_ro

The English-Romanian translation dataset from the shared task of the First Conference on Machine Translation (WMT16). Machine Translation

Multilingual Knowledge Integration

MLKI_EP mlki/ep

Enhancing the phrase-level cross-lingual entity alignment in language models, which is suitable for knowledge graph tasks Multilingual Knowledge Integration

MLKI_ES mlki/es

Enhancing the sentence-level cross-lingual entity alignment in language models, which is suitable for language modeling tasks Multilingual Knowledge Integration

MLKI_TP mlki/tp

Enhancing the phrase-level factual triple knowledge in language models, which is suitable for knowledge graph tasks Multilingual Knowledge Integration

MLKI_TS mlki/ts

Enhancing the sentence-level factual triple knowledge in language models, which is suitable for language modeling tasks Multilingual Knowledge Integration

Named Entity Recognition

conll2003 ner/conll2003

The CoNLL 2003 Task is a language independent Named Entity Recognition. In the dataset are englich and german training, development and test sets. The english data is from the Reuter corpus and... Named Entity Recognition

MIT Movie Corpus (trivia10k13) ner/mit_movie_trivia

The MIT Movie Corpus contains movie reviews tagged in BIO format. We use the larger trivia10k13 corpus which contains more complex examples. Named Entity Recognition

Natural Language Inference

CommitmentBank nli/cb

The CommitmentBank is a corpus of naturally occurring discourses whose final sentence contains a clause-embedding predicate under an entailment canceling operator. Natural Language Inference

MultiNLI nli/multinli

Multi-Genre Natural Language Inference is a large-scale, crowdsourced entailment classification task. Given a pair of sentences, the goal is to predict whether the second sentence is an... Natural Language Inference

QNLI nli/qnli

Question Natural Language Inference is a version of SQuAD which has been converted to a binary classification task. The positive examples are (question, sentence) pairs which do contain... Natural Language Inference

RTE nli/rte

Recognizing Textual Entailment is a binary entailment task similar to MNLI, but with much less training data. Natural Language Inference

SciTail nli/scitail

The SciTail dataset is an entailment dataset created from multiple-choice science exams and web sentences. Each question and the correct answer choice are converted into an assertive statement to... Natural Language Inference

SICK nli/sick

The SICK relatedness (SICK-R) task trains a linear model to output a score from 1 to 5 indicating the relatedness of two sentences. For the same dataset (SICK-E) can be treated as a three-class... Natural Language Inference

Part-Of-Speech Tagging

CoNLL 2003 pos/conll2003

POS-tagging using the part-of-speech tags provided by the English subset of the dataset for the CoNLL 2003 shared task for NER. Part-Of-Speech Tagging

LDC2012T13 pos/ldc2012t13

English Web Treebank was developed by the Linguistic Data Consortium (LDC) with funding through a gift from Google Inc. It consists of over 250,000 words of English weblogs, newsgroups, email,... Part-Of-Speech Tagging

UD English EWT pos/ud_ewt

POS-tagging on Universal Dependencies English EWT. UD English EWT is a Gold Standard Universal Dependencies Corpus for English, built over the source material of the English Web Treebank... Part-Of-Speech Tagging

Phrase Chunking

CoNLL 2000 chunk/conll2000

Text chunking was the shared task for CoNLL-2000. This data consists of the same partitions of the Wall Street Journal corpus (WSJ) as the widely used data for noun phrase chunking: sections 15-18... Phrase Chunking

CONLL2003 chunk/conll2003

CoNLL-2003 shared task:language-independent named entity recognition is a dataset corpus consisting of the Reuters news stories between August 1996 and August 1997. Each line of the corpus has... Phrase Chunking

Quality Estimation

WMT21 quality_estimation/wmt21

The WMT21 shared task on quality estimation. Training language pairs: high-resource English--German (En-De) and English--Chinese (En-Zh), medium-resource Russian-English (Ru-En), Romanian--English... Quality Estimation

Question Answering

BoolQ qa/boolq

We build a reading comprehension dataset, BoolQ, of such questions, and show that they are unexpectedly challenging. Question Answering

ComplexQuestions qa/cq

ComplexQuestions is a QA dataset originally created for querying knowledge bases. The questions are obtained from Google web queries and don’t provide supporting context paragraphs inthe original... Question Answering

narrativeqa qa/narrativeqa

NarrativeQA is dataset of stories and corresponding questions which are designed to test reading comprehension. Question Answering

SQuAD 1.1 qa/squad1

Stanford Question Answering Dataset (SQuAD) is a reading comprehension dataset, consisting of questions posed by crowdworkers on a set of Wikipedia articles, where the answer to every question is... Question Answering

SQuAD 2.0 qa/squad2

SQuAD2.0 combines the 100,000 questions in SQuAD1.1 with over 50,000 unanswerable questions written adversarially by crowdworkers to look similar to answerable ones. To do well on SQuAD2.0,... Question Answering

WikiHop qa/wikihop

WikiHop is open-domain and based on Wikipedia articles; the goal is to recover Wikidata information by hopping through documents. The example on the right shows the relevant documents leading to... Question Answering

Semantic Tagging

Parallel Meaning Bank semtag/pmb

The Parallel Meaning Bank (PMB), developed at the University of Groningen and building upon the Groningen Meaning Bank, comprises sentences and texts in raw and tokenised format, syntactic... Semantic Tagging

Semantic Textual Similarity

MRPC sts/mrpc

Microsoft Research Paraphrase Corpus consists of sentence pairs automatically extracted from online news sources, with human annotations for whether the sentences in the pair are semantically equivalent. Semantic Textual Similarity

QQP sts/qqp

Quora Question Pairs is a binary classification task where the goal is to determine if two questions asked on Quora are semantically equivalent. Semantic Textual Similarity

StackExchange QA Similarity sts/stackexchange

StackExchange QA similarity determines whether two questions or a question-answer pair in StackExchange forums are related or not (e.g., to find duplicate questions or relevant answers). Semantic Textual Similarity

STS-B sts/sts-b

The Semantic Textual Similarity Benchmark is a collection of sentence pairs drawn from news headlines and other sources. They were annotated with a score from 1 to 5 denoting how similar the two... Semantic Textual Similarity

Sentiment Analysis

Hinglish Sentiment sentiment/hinglish-twitter-sentiment

This dataset was released as part of SemEval 2020, Task 9 on Sentiment Analysis in Code Mixed Social Media (Twitter) text. It tags positive, neutral and negative sentiment. There are 17,000... Sentiment Analysis

IMDb sentiment/imdb

MDB dataset having 50K movie reviews for natural language processing or Text analytics. This is a dataset for binary sentiment classification containing substantially more data than previous... Sentiment Analysis

Rotten Tomatoes Movie Reviews sentiment/rotten_tomatoes

Movie Review Dataset. This is a dataset of containing 5,331 positive and 5,331 negative processed sentences from Rotten Tomatoes movie reviews. This data was first used in Bo Pang and Lillian Lee,... Sentiment Analysis

SST-2 sentiment/sst-2

The Stanford Sentiment Treebank is a binary single-sentence classification task consisting of sentences extracted from movie reviews with human annotations of their sentiment. Sentiment Analysis

Summarization

CNN/ DailyMail sum/cnn_dailymail

CNN/ DailyMail summarization dataset. Summarization

XSum sum/xsum

Extreme Summarization (XSum) Dataset. Summarization

Wiki Ann NER

Arabic wikiann/ar

NER on Wikipedia Documents. Wiki Ann NER

English wikiann/en

NER on Wikipedia Documents. Wiki Ann NER

Japanese wikiann/ja

NER on Wikipedia Documents. Wiki Ann NER

Chinese wikiann/zh

NER on Wikipedia Documents. Wiki Ann NER

Word Sense Disambiguation

WiC wordsense/wic

The Word-in-Context dataset is a word sense disambiguation task in the form of a binary classification task. Each dataset example consists of two sentences and a polysemous word that appears in... Word Sense Disambiguation