Article,

On the effect of dropping layers of pre-trained transformer models.

, , , and .
Comput. Speech Lang., (2023)

Meta data

Tags

Users

  • @dblp

Comments and Reviews