Pre-trained model:
Houlsby Adapter trained with Masked Language Modelling on French Wikipedia Articles for 250k steps and a batch size of 64.
Pfeiffer Adapter trained with Masked Language Modelling on French Wikipedia Articles for 250k steps and a batch size of 64.