Pre-trained model:
Houlsby Adapter trained with Masked Language Modelling on Estonian Wikipedia Articles for 250k steps and a batch size of 64.
Pfeiffer Adapter trained with Masked Language Modelling on Estonian Wikipedia Articles for 250k steps and a batch size of 64.