LeMaterial: an open source initiative to accelerate materials discovery and research 28 days ago β’ 32
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ 4 days ago β’ 30
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 26 items β’ Updated Nov 14, 2024 β’ 543
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper β’ 2412.04454 β’ Published Dec 5, 2024 β’ 57
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. β’ 40 items β’ Updated 18 days ago β’ 75
view article Article FineWeb2-C: Help Build Better Language Models in Your Language By davanstrien β’ 14 days ago β’ 11
TabuLa-8B Collection Training, eval suite, and model from the paper "Large Scale Transfer Learning for Tabular Data via Language Modeling" https://arxiv.org/abs/2406.12031 β’ 4 items β’ Updated Jun 19, 2024 β’ 11
LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment Paper β’ 2412.04814 β’ Published Dec 6, 2024 β’ 45
Solving Quantitative Reasoning Problems with Language Models Paper β’ 2206.14858 β’ Published Jun 29, 2022 β’ 1
AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials Paper β’ 2412.09605 β’ Published 25 days ago β’ 26
ShowUI: One Vision-Language-Action Model for GUI Visual Agent Paper β’ 2411.17465 β’ Published Nov 26, 2024 β’ 77
Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations Paper β’ 2411.00640 β’ Published Nov 1, 2024 β’ 3
BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays Paper β’ 2410.21969 β’ Published Oct 29, 2024 β’ 9
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent Paper β’ 2411.02265 β’ Published Nov 4, 2024 β’ 24
view article Article Breaking Barriers: The Critical Role of Art and Design in Advancing AI Capabilities By fffiloni β’ Jan 15, 2024 β’ 3
LoLCATS Collection Linearizing LLMs with high quality and efficiency. We linearize the full Llama 3.1 model family -- 8b, 70b, 405b -- for the first time! β’ 4 items β’ Updated Oct 14, 2024 β’ 15