Chong Ruan
Chester111
AI & ML interests
AGI & LLM
Recent Activity
updated
a collection
about 21 hours ago
DeepSeek-V3
authored
a paper
20 days ago
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced
Multimodal Understanding
updated
a collection
20 days ago
DeepSeek-VL2
Organizations
Chester111's activity
236B?
1
#9 opened 24 days ago
by
erichartford
Adds `transformers` as a library
1
#1 opened 27 days ago
by
reach-vb
Update README.md
#5 opened 3 months ago
by
xianbao
Update metadata
#9 opened 4 months ago
by
xianbao
Add base_model metadata
#8 opened 5 months ago
by
davanstrien
Add base_model metadata
#2 opened 5 months ago
by
davanstrien
Add base_model metadata
#3 opened 5 months ago
by
davanstrien
Trained on Code Search Net
1
#5 opened about 1 year ago
by
admarcosai
Thank you for making my dream come true ❤️
1
#10 opened about 1 year ago
by
rombodawg
Deepseek-Coder at models leaderboard
2
#9 opened about 1 year ago
by
bitsnaps
tokenizer.model
6
#6 opened about 1 year ago
by
nds90
prompt format?
1
#8 opened about 1 year ago
by
obtion
Confirming the EOS token? 32021 or 32014? Or both?
4
#1 opened about 1 year ago
by
TheBloke
Enhancement Request: Model Sharding for DeepSeek-Coder-6.7b-Instruct
2
#4 opened about 1 year ago
by
Firejowl
AMAZING WORK
1
#7 opened about 1 year ago
by
gmacgmac
Could you upload tokenizer.model for this and other models?
3
#4 opened about 1 year ago
by
RonanMcGovern
quantized versions?
1
#2 opened about 1 year ago
by
cameronbergh
LICENSE file is 0 bytes
1
#3 opened about 1 year ago
by
TheBloke
DeepSeek Coder is not based on Llama 2
8
#2 opened about 1 year ago
by
Chester111