nlpguy's picture

nlpguy

nlpguy

·

AI & ML interests

large language models

Recent Activity

new activity 12 days ago

deepseek-ai/DeepSeek-V3-Base:Very impressive. Good world knowledge (SimpleQA of 25) despite high math/coding performance.

new activity 19 days ago

matteogeniaccio/phi-4:🚩 Report: Legal issue(s)

new activity 22 days ago

matteogeniaccio/phi-4:Notably better than Phi3.5 in many ways, but something is wrong.

View all activity

Organizations

None yet

nlpguy's activity

New activity in deepseek-ai/DeepSeek-V3-Base 12 days ago

Very impressive. Good world knowledge (SimpleQA of 25) despite high math/coding performance.

#27 opened 12 days ago by

New activity in matteogeniaccio/phi-4 19 days ago

🚩 Report: Legal issue(s)

#9 opened 19 days ago by

New activity in matteogeniaccio/phi-4 22 days ago

Notably better than Phi3.5 in many ways, but something is wrong.

#5 opened 22 days ago by

New activity in mradermacher/smolchess-v2-GGUF 2 months ago

How do you quantitize that so quickly?

#1 opened 2 months ago by

New activity in mlabonne/chessllm 2 months ago

Love the Idea, one tiny request.

#2 opened 2 months ago by

New activity in Luni/StarDust-12b-v2 4 months ago

Would you be willing to share the mergekit config?

#4 opened 4 months ago by

New activity in nlpguy/StableProse 4 months ago

Adding Evaluation Results

#1 opened 4 months ago by

leaderboard-pr-bot

New activity in PocketDoc/Dans-MemoryCore-CoreCurriculum-Small 4 months ago

Was this dataset created with Claude Sonnet 3 or 3.5?

#2 opened 4 months ago by

New activity in open-llm-leaderboard/open_llm_leaderboard 4 months ago

leaderboard should be more curated

#908 opened 4 months ago by

New activity in black-forest-labs/FLUX.1-schnell 4 months ago

Licence issue

#55 opened 4 months ago by

New activity in open-llm-leaderboard/open_llm_leaderboard 4 months ago

Model Failed: StableProse

#894 opened 5 months ago by

New activity in v000000/MN-12B-Estrella-v1 5 months ago

would you consider publishing the intermediate models from step 1 and 2

#1 opened 5 months ago by

New activity in open-llm-leaderboard/open_llm_leaderboard 6 months ago

Voting System: You can vote for your own model.

#851 opened 6 months ago by

Submitted models aren't showing up

#835 opened 6 months ago by

Model not on pending for evaluation

#841 opened 6 months ago by

New activity in NousResearch/Hermes-2-Pro-Llama-3-8B 6 months ago

OpenHermes Dataset Cleaning

#17 opened 7 months ago by

New activity in open-llm-leaderboard/open_llm_leaderboard 6 months ago

Wrong results or am i understanding something wrong?

#839 opened 6 months ago by

Leaderboard isn't updating its model list.

#809 opened 6 months ago by

Archive of the last leaderboard

#807 opened 6 months ago by

MarxistLeninist

Models disappearing from eval queue?

#805 opened 6 months ago by