Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper β’ 2412.13663 β’ Published 22 days ago β’ 120
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models Paper β’ 2409.17146 β’ Published Sep 25, 2024 β’ 106
StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images Paper β’ 2406.13735 β’ Published Jun 19, 2024 β’ 5
The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing Paper β’ 2406.10601 β’ Published Jun 15, 2024 β’ 66