Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
10
11
20
Shengyi Costa Huang
vwxyzjn
Follow
muellerzr's profile picture
thomwolf's profile picture
eagle0504's profile picture
56 followers
·
4 following
http://costa.sh
vwxyzjn
vwxyzjn
AI & ML interests
None yet
Recent Activity
new
activity
about 20 hours ago
allenai/OLMo-2-1124-13B-SFT:
Very, very confused
updated
a model
about 20 hours ago
allenai/OLMo-2-1124-13B-Instruct-RLVR2
updated
a model
about 20 hours ago
allenai/OLMo-2-1124-13B-Instruct-RLVR1
View all activity
Articles
How NuminaMath Won the 1st AIMO Progress Prize
Jul 11, 2024
•
110
Preference Optimization for Vision Language Models
Jul 10, 2024
•
54
Putting RL back in RLHF
Jun 12, 2024
•
66
Constitutional AI with Open LLMs
Feb 1, 2024
•
13
The N Implementation Details of RLHF with PPO
Oct 24, 2023
•
24
Organizations
vwxyzjn
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
allenai/OLMo-2-1124-13B-SFT
about 20 hours ago
Very, very confused
2
#1 opened 3 days ago by
mradermacher
New activity in
lvwerra/starcoderbase-gsm8k
over 1 year ago
Create README.md
#2 opened over 1 year ago by
vwxyzjn
New activity in
vwxyzjn/starcoderbase-triviaqa
over 1 year ago
Adding `safetensors` variant of this model
#1 opened over 1 year ago by
SFconvertbot
New activity in
vwxyzjn/lm-human-preferences
over 1 year ago
About bookCorpus and tldr/train-subset.json
1
#1 opened over 1 year ago by
shuyuthriving
About bookCorpus and tldr/train-subset.json
1
#1 opened over 1 year ago by
shuyuthriving