Heterogeneous Bert

Published: June 27, 2023

We have implemented a neural architecture search and a super-network training framework for heterogeneous BERT models. Given the search space and a teacher model, the super-network is automatically trained and the network structures are evaluated using balanced Pareto sampling. Compared to traditional neural architecture search frameworks, our approach achieves higher accuracy, faster convergence for sub-models, and superior performance under the same structural configurations.

Share on

Twitter Facebook LinkedIn

Xukun Liu

Heterogeneous Bert

Share on

Leave a Comment