Clem 🤗's picture

Clem 🤗 PRO

clem

·

http://huggingface.co

AI & ML interests

multi-modal, time-series, biology and chemistry

Recent Activity

liked a dataset about 13 hours ago

ylecun/mnist

replied to their post about 13 hours ago

Cool to see @ylecun joining the top 10 of most followed on HF! (and leaderboard by @mvaloatto is here: https://huggingface.co/spaces/mvaloatto/TCTF)

reacted to cfahlgren1's post with 🚀 1 day ago

The https://huggingface.co/deepseek-ai/DeepSeek-V3 is very good! I have been playing with it and found it is really good at one-shotting a pretty good landing page. You can play with it here: https://deepseek-artifacts.vercel.app All the responses get saved in the https://huggingface.co/datasets/cfahlgren1/react-code-instructions dataset. Hopefully we can build one of the biggest, highest quality frontend datasets on the hub 💪

View all activity

Organizations

clem's activity

upvoted a paper 16 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 18 days ago • 116

upvoted a collection 16 days ago

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 16 days ago • 112

upvoted a paper 17 days ago

The Open Source Advantage in Large Language Models (LLMs)

Paper • 2412.12004 • Published 19 days ago • 9

upvoted 17 papers 19 days ago

SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding

Paper • 2412.09604 • Published 23 days ago • 35

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published 23 days ago • 87

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 22 days ago • 136

Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions

Paper • 2412.08737 • Published 24 days ago • 52

Phi-4 Technical Report

Paper • 2412.08905 • Published 24 days ago • 95

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published 23 days ago • 92

POINTS1.5: Building a Vision-Language Model towards Real World Applications

Paper • 2412.08443 • Published 24 days ago • 38

LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations

Paper • 2412.08580 • Published 24 days ago • 45

SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Paper • 2412.07760 • Published 25 days ago • 50

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published 25 days ago • 46

Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published 29 days ago • 47

STIV: Scalable Text and Image Conditioned Video Generation

Paper • 2412.07730 • Published 25 days ago • 70

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published 26 days ago • 66

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published 26 days ago • 72

Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

Paper • 2412.06531 • Published 26 days ago • 71

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published 29 days ago • 46

EXAONE 3.5: Series of Large Language Models for Real-world Use Cases

Paper • 2412.04862 • Published 30 days ago • 49