AdapterHub - ukp/roberta-large-sts_b

Edit on GitHub

model = AutoAdapterModel.from_pretrained("roberta-large")
config = AdapterConfig.load("houlsby")
model.load_adapter("sts/sts-b@ukp", config=config)

Description

STS-B adapter (with head) trained using the `run_glue.py` script with an extension that retains the best checkpoint (out of 30 epochs).

Properties

Pre-trained model

roberta-large

Adapter type

Task

Prediction Head

Yes

Task

Semantic Textual Similarity

Dataset

STS-B

Architecture

Name

houlsby

Non-linearity

swish

Reduction factor

{
  "ln_after": false,
  "ln_before": false,
  "mh_adapter": true,
  "output_adapter": true,
  "adapter_residual_before_ln": false,
  "non_linearity": null,
  "original_ln_after": true,
  "original_ln_before": false,
  "reduction_factor": null,
  "residual_before_ln": true
}

Author

Name

Andreas Rücklé

E-Mail

rueckle@ukp.informatik.tu-darmstadt

Web

http://rueckle.net

GitHub

arueckle

Twitter

@arueckle

Versions

Identifier	Comment	Score	Download
1 DEFAULT	Achieves 92.32 Spearman rank correlation on the STS-Benchmark (devset)

Citations

Adapter

@article{pfeiffer2020AdapterHub,
    title={AdapterHub: A Framework for Adapting Transformers},
    author={Jonas Pfeiffer,
            Andreas R\"uckl\'{e},
            Clifton Poth,
            Aishwarya Kamath,
            Ivan Vuli\'{c},
            Sebastian Ruder,
            Kyunghyun Cho,
            Iryna Gurevych},
    journal={ArXiv},
    year={2020}
}

Architecture

@misc{houlsby2019parameterefficient,
  title={Parameter-Efficient Transfer Learning for NLP},
  author={Neil Houlsby and Andrei Giurgiu and Stanislaw Jastrzebski and Bruna Morrone and Quentin de Laroussilhe and Andrea Gesmundo and Mona Attariyan and Sylvain Gelly},
  year={2019},
  eprint={1902.00751},
  archivePrefix={arXiv},
  primaryClass={cs.LG}
}

Task

@article{cer2017semeval,
  title={Semeval-2017 task 1: Semantic textual similarity-multilingual and cross-lingual focused evaluation},
  author={Cer, Daniel and Diab, Mona and Agirre, Eneko and Lopez-Gazpio, Inigo and Specia, Lucia},
  journal={arXiv preprint arXiv:1708.00055},
  year={2017}
}

Adapter for STS-B

ukp / roberta-large-sts_b_houlsby

Description

Properties

Architecture

Configuration

Author

Versions

Citations

BibTeX

Adapter

Architecture

Task