AdapterHub
  •   Explore
  •   Upload
  •   Docs
  •   Blog
  •  
  •  
  1. Explore
  2. Task

Task Adapters

Pre-trained model:

All architectures
All architectures
bert bart xlm-roberta distilbert gpt2 roberta mbart

QQP

Quora Question Pairs is a binary classification task where the goal is to determine if two questions asked on Quora are semantically equivalent.
  Website
sts/qqp@ukp bert-base-uncased
1 version Architecture: pfeiffer Head: 

Adapter in Pfeiffer architecture trained on the QQP task for 20 epochs with early stopping and a learning rate of 1e-4. See https://arxiv.org/pdf/2007.07779.pdf.

sts/qqp@ukp bert-base-uncased
1 version Architecture: houlsby Head: 

Adapter in Houlsby architecture trained on the QQP task for 20 epochs with early stopping and a learning rate of 1e-4. See https://arxiv.org/pdf/2007.07779.pdf.

sts/qqp@ukp distilbert-base-uncased
1 version Architecture: pfeiffer Head: 

Adapter for distilbert-base-uncased in Pfeiffer architecture trained on the QQP dataset for 15 epochs with early stopping and a learning rate of 1e-4.

sts/qqp@ukp facebook/bart-base
1 version Architecture: houlsby non-linearity: swish reduction factor: 16 Head: 

Adapter for bart-base in Houlsby architecture trained on the QQP dataset for 15 epochs with early stopping and a learning rate of 1e-4.

sts/qqp@ukp roberta-base
1 version Architecture: pfeiffer Head: 

Pfeiffer Adapter trained on the QQP dataset.

sts/qqp@ukp gpt2
1 version Architecture: pfeiffer non-linearity: relu reduction factor: 16 Head: 

Adapter for gpt2 in Pfeiffer architecture trained on the QQP dataset for 10 epochs with a learning rate of 1e-4.

sts/qqp@ukp facebook/bart-base
1 version Architecture: pfeiffer non-linearity: relu reduction factor: 16 Head: 

Adapter for bart-base in Pfeiffer architecture trained on the QQP dataset for 15 epochs with early stopping and a learning rate of 1e-4.

sts/qqp@ukp gpt2
1 version Architecture: houlsby non-linearity: swish reduction factor: 16 Head: 

Adapter for gpt2 in Houlsby architecture trained on the QQP dataset for 10 epochs with a learning rate of 1e-4.

sts/qqp@ukp distilbert-base-uncased
1 version Architecture: houlsby Head: 

Adapter for distilbert-base-uncased in Houlsby architecture trained on the QQP dataset for 15 epochs with early stopping and a learning rate of 1e-4.

AdapterHub/bert-base-uncased-pf-qqp bert-base-uncased
huggingface.co Head: 

# Adapter `AdapterHub/bert-base-uncased-pf-qqp` for bert-base-uncased An [adapter](https://adapterhub.ml) for the `bert-base-uncased` model that was trained on the...

AdapterHub/roberta-base-pf-qqp roberta-base
huggingface.co Head: 

# Adapter `AdapterHub/roberta-base-pf-qqp` for roberta-base An [adapter](https://adapterhub.ml) for the `roberta-base` model that was trained on the...

Paper | Imprint & Privacy

Brought to you with ❤️  by authors from: