sum/xsum@ukp
facebook/bart-large
1 version
Architecture: houlsby
non-linearity: swish
reduction factor: 2
Head:
Adapter for bart-large in Pfeiffer architecture trained on the XSum dataset for 10 epochs with early stopping and a learning rate of 1e-4.