GAIA release Collection Gather the items of the GAIA release β’ 4 items β’ Updated Nov 23, 2023 β’ 20
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ 4 days ago β’ 30
SwiftKV Models Collection SwiftKV reduces prefill compute by up to 50% by combining model rewiring and knowledge-preserving self-distillation. β’ 3 items β’ Updated Dec 5, 2024 β’ 3
view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 β’ 7 days ago β’ 19
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey Paper β’ 2412.18619 β’ Published 22 days ago β’ 49
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search Paper β’ 2412.18319 β’ Published 13 days ago β’ 34
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth By mlabonne β’ Jul 29, 2024 β’ 260
MixLLM: LLM Quantization with Global Mixed-precision between Output-features and Highly-efficient System Design Paper β’ 2412.14590 β’ Published 19 days ago β’ 13
SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator Paper β’ 2412.12094 β’ Published 21 days ago β’ 10
Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents Paper β’ 2412.13194 β’ Published 20 days ago β’ 12
Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture Paper β’ 2412.11834 β’ Published 21 days ago β’ 6