Tom Aarsen's picture

Tom Aarsen

tomaarsen

·

https://linkedin.com/in/tomaarsen

AI & ML interests

NLP: text embeddings, information retrieval, named entity recognition, few-shot text classification

Recent Activity

upvoted a paper about 1 hour ago

KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model

upvoted a collection about 1 hour ago

liked a model about 1 hour ago

HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1.5

View all activity

Articles

Finally, a Replacement for BERT: Introducing ModernBERT

Welcome Gemma 2 - Google's new open LLM

Training and Finetuning Embedding Models with Sentence Transformers v3

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

🪆 Introduction to Matryoshka Embedding Models

SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit

🕳️ Attention Sinks in LLMs for endless fluency

Organizations

tomaarsen's activity

upvoted a paper about 1 hour ago

KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model

Paper • 2501.01028 • Published 5 days ago • 1

upvoted a collection about 1 hour ago

KaLM-embedding

5 items • Updated 4 days ago • 7

upvoted a collection 2 days ago

OLMo 2

Artifacts for the second set of OLMo models. • 22 items • Updated about 3 hours ago • 69

upvoted a collection 3 days ago

PubMedBERT Embeddings M2V

Models distilled with Model2Vec - 100K / 500K / 1M / 2M / 8M parameter variants. • 5 items • Updated 3 days ago • 3

upvoted a collection 4 days ago

ModernGLiNER

GLiNER models based on modern encoder architectures • 2 items • Updated 13 days ago • 6

upvoted an article 7 days ago

Article

Fine-tune ModernBERT for text classification using synthetic data

By

•

7 days ago

• 19

upvoted a collection 10 days ago

Granite 3.1 Language Models

A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 8 items • Updated 19 days ago • 47

upvoted 2 papers 17 days ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 12

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 18 days ago • 337

upvoted an article 17 days ago

Article

Use Models from the Hugging Face Hub in LM Studio

By

•

Nov 28, 2024

• 127

upvoted a paper 18 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 20 days ago • 119

upvoted a collection 18 days ago

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 18 days ago • 116

upvoted 4 articles about 1 month ago

Article

Building a Local Vector Database Index with Annoy and Sentence Transformers

By

•

Dec 5, 2024

• 3

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

By

•

Dec 4, 2024

• 75

Article

Accelerating Embedding & Reranking Models on AMD Using Infinity

By

•

Dec 3, 2024

• 4

Article

EuroLLM-9B

By

•

Dec 2, 2024

• 105

upvoted a paper about 1 month ago

A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection

Paper • 2411.12946 • Published Nov 20, 2024 • 20

upvoted a collection about 2 months ago

Models for dataset curation

9 items • Updated Dec 5, 2024 • 17

upvoted an article about 2 months ago

Article

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

By

•

Nov 21, 2024

• 35

upvoted a paper about 2 months ago

Drowning in Documents: Consequences of Scaling Reranker Inference

Paper • 2411.11767 • Published Nov 18, 2024 • 17