Aymeric Roucher's picture

Aymeric Roucher

m-ric

·

http://aymeric-roucher.github.io

AI & ML interests

Leading Agents at Hugging Face 🤗

Recent Activity

upvoted an article about 1 hour ago

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

upvoted an article about 1 hour ago

Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️

liked a model about 2 hours ago

MoritzLaurer/ModernBERT-base-zeroshot-v2.0

View all activity

Articles

Introducing smolagents: simple agents that write actions in code.

Expert Support case study: Bolstering a RAG app with LLM-as-a-Judge

Our Transformers Code Agent beats the GAIA benchmark!

Extracting Concepts from LLMs: Anthropic’s recent discoveries 📖

License to Call: Introducing Transformers Agents 2.0

Open-source LLMs as LangChain Agents

Organizations

m-ric's activity

upvoted 2 articles about 1 hour ago

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

By

•

4 days ago

• 31

Article

Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️

By

•

2 days ago

• 3

upvoted a paper 7 days ago

A New Approach for Explainable Multiple Organ Annotation with Few Data

Paper • 1912.12932 • Published Dec 30, 2019 • 1

upvoted an article 21 days ago

Article

🇪🇺✍️ EU AI Act: Systemic Risks in the First CoP Draft Comments ✍️🇪🇺

By

•

25 days ago

• 12

upvoted a collection 22 days ago

Diffusion Tools

4 items • Updated Apr 30, 2024 • 5

upvoted a collection 23 days ago

GUI agents

A collection of papers on GUI agents • 3 items • Updated 24 days ago • 5

upvoted a paper 23 days ago

AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials

Paper • 2412.09605 • Published 25 days ago • 26

upvoted 2 papers 27 days ago

If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents

Paper • 2401.00812 • Published Jan 1, 2024 • 4

Code Agents are State of the Art Software Testers

Paper • 2406.12952 • Published Jun 18, 2024 • 1

upvoted a collection 27 days ago

Awesome Computer Use Agents

https://github.com/ranpox/awesome-computer-use • 25 items • Updated 19 days ago • 7

upvoted a paper 27 days ago

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Paper • 2412.04454 • Published Dec 5, 2024 • 57

upvoted 2 articles about 1 month ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

By

•

Dec 4, 2024

• 75

Article

They Said It Couldn’t Be Done

By

•

Dec 5, 2024

• 76

upvoted 2 papers about 1 month ago

Agent Workflow Memory

Paper • 2409.07429 • Published Sep 11, 2024 • 29

ShowUI: One Vision-Language-Action Model for GUI Visual Agent

Paper • 2411.17465 • Published Nov 26, 2024 • 77

upvoted an article about 1 month ago

Article

EuroLLM-9B

By

•

Dec 2, 2024

• 105

upvoted a paper about 1 month ago

DynaSaur: Large Language Agents Beyond Predefined Actions

Paper • 2411.01747 • Published Nov 4, 2024 • 20

upvoted 3 articles about 2 months ago

Article

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

By

•

Nov 21, 2024

• 35

Article

Halo: Open Source Health Tracking with Wearables

By

•

Nov 19, 2024

• 99

Article

Decoding Strategies in Large Language Models

By

•

Oct 29, 2024

• 38