nlpguy
nlpguy
AI & ML interests
large language models
Recent Activity
new activity
19 days ago
matteogeniaccio/phi-4:🚩 Report: Legal issue(s)
new activity
22 days ago
matteogeniaccio/phi-4:Notably better than Phi3.5 in many ways, but something is wrong.
Organizations
None yet
nlpguy's activity
Very impressive. Good world knowledge (SimpleQA of 25) despite high math/coding performance.
2
#27 opened 12 days ago
by
phil111
🚩 Report: Legal issue(s)
6
#9 opened 19 days ago
by
nightflightdk
Notably better than Phi3.5 in many ways, but something is wrong.
8
#5 opened 22 days ago
by
phil111
How do you quantitize that so quickly?
1
#1 opened 2 months ago
by
nlpguy
Love the Idea, one tiny request.
#2 opened 2 months ago
by
nlpguy
Would you be willing to share the mergekit config?
1
#4 opened 4 months ago
by
nlpguy
Adding Evaluation Results
#1 opened 4 months ago
by
leaderboard-pr-bot
Was this dataset created with Claude Sonnet 3 or 3.5?
2
#2 opened 4 months ago
by
nlpguy
leaderboard should be more curated
7
#908 opened 4 months ago
by
ehartford
Licence issue
2
#55 opened 4 months ago
by
Ayaz550
Model Failed: StableProse
3
#894 opened 5 months ago
by
nlpguy
would you consider publishing the intermediate models from step 1 and 2
2
#1 opened 5 months ago
by
nlpguy
Voting System: You can vote for your own model.
3
#851 opened 6 months ago
by
nlpguy
Submitted models aren't showing up
4
#835 opened 6 months ago
by
Stark2008
Model not on pending for evaluation
3
#841 opened 6 months ago
by
acbdkk
OpenHermes Dataset Cleaning
1
#17 opened 7 months ago
by
ashologn
Wrong results or am i understanding something wrong?
8
#839 opened 6 months ago
by
nicobuko
Leaderboard isn't updating its model list.
3
#809 opened 6 months ago
by
nlpguy
Archive of the last leaderboard
5
#807 opened 6 months ago
by
MarxistLeninist
Models disappearing from eval queue?
7
#805 opened 6 months ago
by
ArkaAbacus