Edit on GitHub

model = AutoAdapterModel.from_pretrained("gpt2")
config = AdapterConfig.load("pfeiffer", non_linearity="relu", reduction_factor=16)
model.load_adapter("sts/sts-b@ukp", config=config)

Description

Adapter for gpt2 in Pfeiffer architecture trained on the STS-B dataset for 10 epochs with a learning rate of 1e-4.

Properties

Pre-trained model
gpt2
Adapter type
Prediction Head
  Yes
Task
Semantic Textual Similarity
Dataset

Architecture

Name
pfeiffer
Non-linearity
relu
Reduction factor
16
{
  "ln_after": false,
  "ln_before": false,
  "mh_adapter": false,
  "output_adapter": true,
  "adapter_residual_before_ln": false,
  "non_linearity": "relu",
  "original_ln_after": true,
  "original_ln_before": true,
  "reduction_factor": 16,
  "residual_before_ln": true
}

Author

  Name
Hannah Sterz
  Twitter

Versions

Identifier Comment Score Download
1 DEFAULT

Citations

Architecture
@misc{pfeiffer2020adapterfusion,
  title={AdapterFusion: Non-Destructive Task Composition for Transfer Learning},
  author={Jonas Pfeiffer and Aishwarya Kamath and Andreas Rücklé and Kyunghyun Cho and Iryna Gurevych},
  year={2020},
  eprint={2005.00247},
  archivePrefix={arXiv},
  primaryClass={cs.CL}
}
Task
@article{cer2017semeval,
  title={Semeval-2017 task 1: Semantic textual similarity-multilingual and cross-lingual focused evaluation},
  author={Cer, Daniel and Diab, Mona and Agirre, Eneko and Lopez-Gazpio, Inigo and Specia, Lucia},
  journal={arXiv preprint arXiv:1708.00055},
  year={2017}
}