πͺ SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos β’ 12 items β’ Updated 15 days ago β’ 208
view article Article β΄οΈ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use By Ziyang β’ 3 days ago β’ 9
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings Paper β’ 2501.01257 β’ Published 4 days ago β’ 41
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK By davidberenstein1957 β’ Nov 21, 2024 β’ 35
view post Post 1075 π€π Introducing Observers: A Lightweight SDK for AI Observability ππ€Observers is an open-source Python SDK that provides comprehensive observability for AI applications. Our library makes it easy to:- Track and record interactions with AI models- Store observations in multiple backends- Query and analyse your AI interactions with easehttps://huggingface.co/blog/davidberenstein1957/observers-a-lightweight-sdk-for-ai-observability π₯ 5 5 π 5 5 π 2 2 β€οΈ 2 2 π€ 2 2 + Reply
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ 4 days ago β’ 31
PowerInfer/SmallThinker-3B-Preview Text Generation β’ Updated about 13 hours ago β’ 6.37k β’ β’ 264