sum/cnn_dailymail@ukp
facebook/bart-large
1 version
Architecture: pfeiffer
non-linearity: relu
reduction factor: 2
Head:
Adapter for bart-large in Pfeiffer architecture trained on the CNN/ DailyMail dataset for 10 epochs with early stopping and a learning rate of 1e-4.