1 22 127

peng

superpeng

AI & ML interests

None yet

Recent Activity

liked a dataset 14 days ago

Krystalan/xmediasum

upvoted a paper 14 days ago

DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

upvoted an article about 1 month ago

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

View all activity

Organizations

None yet

superpeng's activity

liked a dataset 14 days ago

Krystalan/xmediasum

Viewer • Updated Feb 15, 2023 • 40k • 39 • 2

upvoted a paper 14 days ago

DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

Paper • 2412.17498 • Published 16 days ago • 21

upvoted an article about 1 month ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

•

Aug 19, 2024

• 75

liked 2 datasets about 1 month ago

O1-OPEN/OpenO1-SFT

Viewer • Updated 22 days ago • 77.7k • 2.15k • 313

medalpaca/medical_meadow_wikidoc

Viewer • Updated Apr 6, 2023 • 10k • 1.08k • 43

liked a dataset 2 months ago

kaiokendev/SuperCOT-dataset

Viewer • Updated May 26, 2023 • 58.3k • 46 • 46

liked a model 2 months ago

kaiokendev/SuperCOT-LoRA

Updated May 6, 2023 • 104

liked a dataset 2 months ago

RLHFlow/prompt-collection-v0.1

Viewer • Updated May 8, 2024 • 179k • 36 • 8

liked a Space 2 months ago

Running

309

📐

Reward Bench Leaderboard

upvoted a collection 2 months ago

Skywork-Reward-Data-Collection

Collection

Open-source preference datasets used to train the Skywork reward model series • 17 items • Updated Oct 12, 2024 • 12

liked a dataset 2 months ago

tasksource/oasst1_pairwise_rlhf_reward

Viewer • Updated Jul 4, 2023 • 18.9k • 87 • 42

liked a dataset 3 months ago

BAAI/AquilaMed-RL

Viewer • Updated Jun 21, 2024 • 12.7k • 40 • 8

updated a collection 3 months ago

LLM Pretrain

Collection

6 items • Updated Oct 8, 2024

liked a dataset 3 months ago

fka/awesome-chatgpt-prompts

Viewer • Updated 2 days ago • 203 • 5.63k • 6.74k

liked 2 datasets 4 months ago

hkust-nlp/deita-10k-v0

Viewer • Updated Dec 31, 2023 • 10k • 109 • 30

survivi/Llama-3-SynE-Dataset

Viewer • Updated 22 days ago • 168M • 976 • 9

liked a model 4 months ago

abacusai/Smaug-Qwen2-72B-Instruct

Text Generation • Updated Aug 6, 2024 • 2.62k • 9

liked 3 datasets 5 months ago