Submitted by starstruckmon t3_1027geh in MachineLearning
omniron t1_j2stl7w wrote
Reply to comment by bloc97 in [R] Massive Language Models Can Be Accurately Pruned in One-Shot by starstruckmon
Just shows we have a huge amount to learn about how these systems actually work
mycall t1_j50h4l7 wrote
It probably is definitely complicated. There are many DAGs to reach similar or repeating patterns, or connections are suboptimal and thus never needed. How do you choose which to keep and which to delete.
Viewing a single comment thread. View all comments