nli/multinli@kabirahuja2431
bert-base-multilingual-cased
2 versions
Architecture: pfeiffer
non-linearity: relu
reduction factor: 2
Head:
Pfeiffer adapter stacked on top of language adapter for the NLI task. Trained on the English MultiNLI data for 5 epochs and a batch size of 64. Version 2 performs better for cross lingual transfer