Edit on GitHub

model = AutoAdapterModel.from_pretrained("roberta-large")
config = AdapterConfig.load("pfeiffer")
model.load_adapter("nli/qnli@ukp", config=config)

Description

QNLI adapter (with head) trained using the `run_glue.py` script with an extension that retains the best checkpoint (out of 10 epochs).

Properties

Pre-trained model
roberta-large
Adapter type
Prediction Head
  Yes
Task
Natural Language Inference
Dataset

Architecture

Name
pfeiffer
Non-linearity
relu
Reduction factor
16
{
  "ln_after": false,
  "ln_before": false,
  "mh_adapter": false,
  "output_adapter": true,
  "adapter_residual_before_ln": false,
  "non_linearity": null,
  "original_ln_after": true,
  "original_ln_before": true,
  "reduction_factor": null,
  "residual_before_ln": true
}

Author

  Name
Andreas Rücklé
  GitHub
  Twitter

Versions

Identifier Comment Score Download
1 DEFAULT Achieves 94.56 Accuracy on the QNLI devset

Citations

Adapter
@article{pfeiffer2020AdapterHub,
    title={AdapterHub: A Framework for Adapting Transformers},
    author={Jonas Pfeiffer,
            Andreas R\"uckl\'{e},
            Clifton Poth,
            Aishwarya Kamath,
            Ivan Vuli\'{c},
            Sebastian Ruder,
            Kyunghyun Cho,
            Iryna Gurevych},
    journal={ArXiv},
    year={2020}
}
Architecture
@misc{pfeiffer2020adapterfusion,
  title={AdapterFusion: Non-Destructive Task Composition for Transfer Learning},
  author={Jonas Pfeiffer and Aishwarya Kamath and Andreas Rücklé and Kyunghyun Cho and Iryna Gurevych},
  year={2020},
  eprint={2005.00247},
  archivePrefix={arXiv},
  primaryClass={cs.CL}
}
Task
@article{wang2018glue,
  title={Glue: A multi-task benchmark and analysis platform for natural language understanding},
  author={Wang, Alex and Singh, Amanpreet and Michael, Julian and Hill, Felix and Levy, Omer and Bowman, Samuel R},
  journal={arXiv preprint arXiv:1804.07461},
  year={2018}
}