As your paper says, you have tried Linformer attention with smaller K. Could you please release the pretrained models? I'm very curious about that.
As your paper says, you have tried Linformer attention with smaller K. Could you please release the pretrained models? I'm very curious about that.