Pre-trained model:
Pfeiffer Adapter trained with Masked Language Modelling on English Wikipedia Articles for 250k steps and a batch size of 64.
Houlsby Adapter trained with Masked Language Modelling on English Wikipedia Articles for 250k steps and a batch size of 64.