Pre-trained model:
Pfeiffer Adapter trained with Masked Language Modelling on English Wikipedia Articles for 250k steps and a batch size of 64.