mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation
•
Updated
•
3.94M
•
•
4.26k
Pruned experts from Mixtral-8x7B-Instruct-v0.1 with respect to the paper "A Provably Effective Method for Pruning Experts in Fine-tuned Sparse MoEs"