ht/wiki@ukp
bert-base-multilingual-cased
2 versions
Architecture: pfeiffer
non-linearity: gelu
reduction factor: 2
Head:
Pfeiffer Adapter trained with Masked Language Modelling on Haitian Creole Wikipedia Articles for 50k steps and a batch size of 64.