Benjamin Clavié's picture

Benjamin Clavié

bclavie

·

https://ben.clavie.eu

AI & ML interests

None yet

Recent Activity

liked a model about 10 hours ago

nomic-ai/modernbert-embed-base

replied to MoritzLaurer's post about 11 hours ago

Quite excited by the ModernBERT release! 0.15/0.4B small, 2T modern pre-training data and tokenizer with code, 8k context window, great efficient model for embeddings & classification! This will probably be the basis for many future SOTA encoders! And I can finally stop using DeBERTav3 from 2021 :D Congrats @answerdotai, @LightOnIO and collaborators like @tomaarsen ! Paper and models here 👇https://huggingface.co/collections/answerdotai/modernbert-67627ad707a4acbf33c41deb

updated a model 12 days ago

answerdotai/ModernBERT-large

View all activity

Articles

Finally, a Replacement for BERT: Introducing ModernBERT

Organizations

bclavie's activity

liked a model about 10 hours ago

nomic-ai/modernbert-embed-base

Sentence Similarity • Updated 2 days ago • 4.54k • 127

replied to MoritzLaurer's post about 11 hours ago

On entailment adjacent tasks (which btw, great work on the zero-shot NLI models @MoritzLaurer !), I'd expect DeBERTa to be slightly better than ModernBERT -- it seems its pretraining objective is better aligned with it. In our evals, we consistently had DeBERTa come on top on MNLI (there's a full GLUE table in the appendix of the paper), it's only on aggregated GLUE that we saw ModernBERT-Base beat DeBERTaV3-Base.

updated 6 models 12 days ago

answerdotai/ModernBERT-large

Fill-Mask • Updated 12 days ago • 30.3k • 298

answerdotai/ModernBERT-base

Fill-Mask • Updated 12 days ago • 101k • 607

bclavie/5e-5_one_label

Updated 12 days ago • 12

bclavie/ModernBERT-base-fineweb-edu-example

Text Classification • Updated 12 days ago • 8

bclavie/ModernBERT-base-fineweb-edu-example

Text Classification • Updated 12 days ago • 8

bclavie/5e-5_one_label

Updated 12 days ago • 12

liked a model 13 days ago

hotchpotch/japanese-splade-v2

Updated 14 days ago • 339 • 6

New activity in answerdotai/ModernBERT-base 14 days ago

How to use ModernBERT with the AutoModelForQuestionAnswering class?

#15 opened 14 days ago by

RuntimeError: Detected that you are using FX to symbolically trace a dynamo-optimized function. This is not supported at the moment.

#14 opened 14 days ago by

liked 2 models 17 days ago

tomaarsen/ModernBERT-base-gooaq

Sentence Similarity • Updated 18 days ago • 416 • 13

tasksource/ModernBERT-base-nli

Zero-Shot Classification • Updated about 14 hours ago • 981 • 10

New activity in answerdotai/ModernBERT-base 18 days ago

How to see which version of Transformers library is needed to get access to this model

#3 opened 18 days ago by

authored a paper 18 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 20 days ago • 119

liked 2 models 18 days ago

answerdotai/ModernBERT-large

Fill-Mask • Updated 12 days ago • 30.3k • 298

answerdotai/ModernBERT-base

Fill-Mask • Updated 12 days ago • 101k • 607

upvoted a collection 18 days ago

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 18 days ago • 116

upvoted a paper 18 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 20 days ago • 119

updated a model 19 days ago

answerdotai/ModernBERT-large

Fill-Mask • Updated 12 days ago • 30.3k • 298