Submitted by seraphaplaca2 t3_122fj05 in MachineLearning
_Arsenie_Boca_ t1_jdqy1n8 wrote
Reply to comment by Co0k1eGal3xy in Is it possible to merge transformers? [D] by seraphaplaca2
Merging model outputs also means you have to run both models. I think the best option is to merge the weights and recover performance using datasets from both domains and distillation from the respective expert model.
Viewing a single comment thread. View all comments