Sam Paech's picture

Sam Paech PRO

sam-paech

·

https://eqbench.com

AI & ML interests

Emotional intelligence, alignment, benchmarking

Recent Activity

updated a dataset about 22 hours ago

sam-paech/BuzzBench-v0.60

liked a model about 1 month ago

SiliconThaumaturgy/Darkest-muse-v1-rk3588-1.1.2

new activity about 2 months ago

sam-paech/Darkest-muse-v1:Love this model but I wish the context was higher

View all activity

Articles

MMLU-Pro-NoMath

Organizations

sam-paech's activity

upvoted a paper 4 months ago

PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation

Paper • 2409.06820 • Published Sep 10, 2024 • 64

upvoted an article 6 months ago

Article

ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models

By

•

Jul 27, 2024

• 28