Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval Mar 22, 2024 β’ 70
KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model Paper β’ 2501.01028 β’ Published 5 days ago β’ 1
OLMo 2 Collection Artifacts for the second set of OLMo models. β’ 22 items β’ Updated about 3 hours ago β’ 69
PubMedBERT Embeddings M2V Collection Models distilled with Model2Vec - 100K / 500K / 1M / 2M / 8M parameter variants. β’ 5 items β’ Updated 3 days ago β’ 3
ModernGLiNER Collection GLiNER models based on modern encoder architectures β’ 2 items β’ Updated 13 days ago β’ 6
view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 β’ 7 days ago β’ 19
Granite 3.1 Language Models Collection A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. β’ 8 items β’ Updated 19 days ago β’ 47
Spectrum: Targeted Training on Signal to Noise Ratio Paper β’ 2406.06623 β’ Published Jun 7, 2024 β’ 12
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb β’ Nov 28, 2024 β’ 127
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper β’ 2412.13663 β’ Published 20 days ago β’ 119
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling β’ 3 items β’ Updated 18 days ago β’ 116
view article Article Building a Local Vector Database Index with Annoy and Sentence Transformers By theeseus-ai β’ Dec 5, 2024 β’ 3
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram β’ Dec 4, 2024 β’ 75
view article Article Accelerating Embedding & Reranking Models on AMD Using Infinity By michaelfeil β’ Dec 3, 2024 β’ 4
A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection Paper β’ 2411.12946 β’ Published Nov 20, 2024 β’ 20
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK By davidberenstein1957 β’ Nov 21, 2024 β’ 35
Drowning in Documents: Consequences of Scaling Reranker Inference Paper β’ 2411.11767 β’ Published Nov 18, 2024 β’ 17