Yikang Shen PRO
YikangS
AI & ML interests
None yet
Organizations
YikangS's activity
When can we have the training code as illustrated in the paper.
12
#5 opened 9 months ago
by
Shamane
why not include Qwen1.5-MoE-A2.7B in the table?
1
#4 opened 9 months ago
by
J22
Dataset?
3
#1 opened 9 months ago
by
0xbitches
Adding `safetensors` variant of this model
#1 opened over 1 year ago
by
SFconvertbot
Adding `safetensors` variant of this model
#1 opened over 1 year ago
by
SFconvertbot