mt/wmt16_en_ro@ukp
facebook/mbart-large-cc25
1 version
Architecture: pfeiffer
non-linearity: relu
reduction factor: 2
Head:
Adapter for mbart-large-cc25 in Pfeiffer architecture with reduction factor 2 trained on the WMT16 Romanian-English translation task.
Training for 10 epochs with early stopping and a learning rate...