AdapterHub
  •   Explore
  •   Upload
  •   Docs
  •   Blog
  •  
  •  
  1. Explore
  2. Task

Task Adapters

Pre-trained model:

All architectures
All architectures
bert bart xlm-roberta distilbert gpt2 roberta mbart

MRPC

Microsoft Research Paraphrase Corpus consists of sentence pairs automatically extracted from online news sources, with human annotations for whether the sentences in the pair are semantically equivalent.
  Website
sts/mrpc@ukp roberta-base
1 version Architecture: houlsby Head: 

MRPC adapter (with head) trained using the `run_glue.py` script with an extension that retains the best checkpoint (out of 30 epochs).

sts/mrpc@ukp roberta-large
1 version Architecture: houlsby Head: 

MRPC adapter (with head) trained using the `run_glue.py` script with an extension that retains the best checkpoint (out of 30 epochs).

sts/mrpc@ukp distilbert-base-uncased
1 version Architecture: pfeiffer Head: 

Adapter for distilbert-base-uncased in Pfeiffer architecture trained on the MRPC dataset for 15 epochs with early stopping and a learning rate of 1e-4.

sts/mrpc@ukp facebook/bart-base
1 version Architecture: houlsby non-linearity: swish reduction factor: 16 Head: 

Adapter for bart-base in Houlsby architecture trained on the MRPC dataset for 15 epochs with early stopping and a learning rate of 1e-4.

sts/mrpc@ukp gpt2
1 version Architecture: houlsby non-linearity: swish reduction factor: 16 Head: 

Adapter for gpt2 in Houlsby architecture trained on the MRPC dataset for 10 epochs with a learning rate of 1e-4.

sts/mrpc@ukp distilbert-base-uncased
1 version Architecture: houlsby Head: 

Adapter for distilbert-base-uncased in Houlsby architecture trained on the MRPC dataset for 15 epochs with early stopping and a learning rate of 1e-4.

sts/mrpc@ukp facebook/bart-base
1 version Architecture: pfeiffer non-linearity: relu reduction factor: 16 Head: 

Adapter for bart-base in Pfeiffer architecture trained on the MRPC dataset for 15 epochs with early stopping and a learning rate of 1e-4.

sts/mrpc@ukp bert-base-uncased
1 version Architecture: houlsby Head: 

Adapter in Houlsby architecture trained on the MRPC task for 20 epochs with early stopping and a learning rate of 1e-4. See https://arxiv.org/pdf/2007.07779.pdf.

sts/mrpc@ukp roberta-base
1 version Architecture: pfeiffer Head: 

Pfeiffer Adapter trained on the MRPC dataset.

sts/mrpc@ukp bert-base-uncased
1 version Architecture: pfeiffer Head: 

Adapter in Pfeiffer architecture trained on the MRPC task for 20 epochs with early stopping and a learning rate of 1e-4. See https://arxiv.org/pdf/2007.07779.pdf.

sts/mrpc@ukp roberta-large
1 version Architecture: pfeiffer Head: 

MRPC adapter (with head) trained using the `run_glue.py` script with an extension that retains the best checkpoint (out of 30 epochs).

sts/mrpc@ukp gpt2
1 version Architecture: pfeiffer non-linearity: relu reduction factor: 16 Head: 

Adapter for gpt2 in Pfeiffer architecture trained on the MRPC dataset for 10 epochs with a learning rate of 1e-4.

AdapterHub/bert-base-uncased-pf-mrpc bert-base-uncased
huggingface.co Head: 

# Adapter `AdapterHub/bert-base-uncased-pf-mrpc` for bert-base-uncased An [adapter](https://adapterhub.ml) for the `bert-base-uncased` model that was trained on the...

AdapterHub/roberta-base-pf-mrpc roberta-base
huggingface.co Head: 

# Adapter `AdapterHub/roberta-base-pf-mrpc` for roberta-base An [adapter](https://adapterhub.ml) for the `roberta-base` model that was trained on the...

Paper | Imprint & Privacy

Brought to you with ❤️  by authors from: