kaizuberbuehler
's Collections
Foundation Models
updated
OLMo: Accelerating the Science of Language Models
Paper
•
2402.00838
•
Published
•
82
Gemini 1.5: Unlocking multimodal understanding across millions of tokens
of context
Paper
•
2403.05530
•
Published
•
61
StarCoder: may the source be with you!
Paper
•
2305.06161
•
Published
•
29
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective
Depth Up-Scaling
Paper
•
2312.15166
•
Published
•
56
Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language
Models
Paper
•
2404.12387
•
Published
•
38
RecurrentGemma: Moving Past Transformers for Efficient Open Language
Models
Paper
•
2404.07839
•
Published
•
43
JetMoE: Reaching Llama2 Performance with 0.1M Dollars
Paper
•
2404.07413
•
Published
•
36
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model
Handling Resolutions from 336 Pixels to 4K HD
Paper
•
2404.06512
•
Published
•
30
Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence
Paper
•
2404.05892
•
Published
•
33
MiniCPM: Unveiling the Potential of Small Language Models with Scalable
Training Strategies
Paper
•
2404.06395
•
Published
•
22
YaART: Yet Another ART Rendering Technology
Paper
•
2404.05666
•
Published
•
16
MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with
Interleaved Visual-Textual Tokens
Paper
•
2404.03413
•
Published
•
25
Advancing LLM Reasoning Generalists with Preference Trees
Paper
•
2404.02078
•
Published
•
44
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your
Phone
Paper
•
2404.14219
•
Published
•
254
CogVLM: Visual Expert for Pretrained Language Models
Paper
•
2311.03079
•
Published
•
23
OpenELM: An Efficient Language Model Family with Open-source Training
and Inference Framework
Paper
•
2404.14619
•
Published
•
126
Pegasus-v1 Technical Report
Paper
•
2404.14687
•
Published
•
30
Jamba: A Hybrid Transformer-Mamba Language Model
Paper
•
2403.19887
•
Published
•
104
Tele-FLM Technical Report
Paper
•
2404.16645
•
Published
•
17
What matters when building vision-language models?
Paper
•
2405.02246
•
Published
•
101
Imp: Highly Capable Large Multimodal Models for Mobile Devices
Paper
•
2405.12107
•
Published
•
26
Paper
•
2406.09414
•
Published
•
95
OpenVLA: An Open-Source Vision-Language-Action Model
Paper
•
2406.09246
•
Published
•
36
Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual
Visual Text Rendering
Paper
•
2406.10208
•
Published
•
22
GEB-1.3B: Open Lightweight Large Language Model
Paper
•
2406.09900
•
Published
•
21
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code
Intelligence
Paper
•
2406.11931
•
Published
•
58
The Llama 3 Herd of Models
Paper
•
2407.21783
•
Published
•
110
SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and
Illumination Disentanglement
Paper
•
2408.00653
•
Published
•
29
Gemma 2: Improving Open Language Models at a Practical Size
Paper
•
2408.00118
•
Published
•
76
Paper
•
2408.07009
•
Published
•
61
Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Paper
•
2408.12570
•
Published
•
31
OLMoE: Open Mixture-of-Experts Language Models
Paper
•
2409.02060
•
Published
•
78
Paper
•
2409.00587
•
Published
•
32
Qwen2.5-Coder Technical Report
Paper
•
2409.12186
•
Published
•
139
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at
Any Resolution
Paper
•
2409.12191
•
Published
•
76
NVLM: Open Frontier-Class Multimodal LLMs
Paper
•
2409.11402
•
Published
•
73
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art
Multimodal Models
Paper
•
2409.17146
•
Published
•
106
Making Text Embedders Few-Shot Learners
Paper
•
2409.15700
•
Published
•
30
EuroLLM: Multilingual Language Models for Europe
Paper
•
2409.16235
•
Published
•
26
stabilityai/stable-diffusion-3.5-large
Text-to-Image
•
Updated
•
129k
•
•
1.8k
Paper
•
2412.16720
•
Published
•
29
NVILA: Efficient Frontier Visual Language Models
Paper
•
2412.04468
•
Published
•
57
PaliGemma 2: A Family of Versatile VLMs for Transfer
Paper
•
2412.03555
•
Published
•
121
Open-Sora Plan: Open-Source Large Video Generation Model
Paper
•
2412.00131
•
Published
•
33
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
Paper
•
2411.15124
•
Published
•
58