Biomedical Collection Models for biomedical research applications, such as radiology report generation and biomedical language understanding. • 9 items • Updated Nov 1, 2024 • 6
SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement Paper • 2408.00653 • Published Aug 1, 2024 • 29
SpeechT5 Collection The SpeechT5 framework consists of a shared seq2seq and six modal-specific (speech/text) pre/post-nets that can address a few audio-related tasks. • 8 items • Updated Jul 11, 2024 • 23
Multimodal Models Collection Multimodal models with leading performance. • 14 items • Updated Nov 17, 2024 • 20
MelodyFlow Collection MelodyFlow: High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching • 7 items • Updated Oct 23, 2024 • 16
LayerSkip Collection Models continually pretrained using LayerSkip - https://arxiv.org/abs/2404.16710 • 8 items • Updated Nov 21, 2024 • 46
Fairseq S^2 TTS Collection Text-to-speech models from fairseq s^2 • 11 items • Updated Jan 16, 2024 • 6
Reward Bench Collection Datasets, spaces, and models for the reward model benchmark! • 5 items • Updated 1 day ago • 9
Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution Paper • 2312.06640 • Published Dec 11, 2023 • 46
Llama 3.2 3B & 1B GGUF Quants Collection Llama.cpp compatible quants for Llama 3.2 3B and 1B Instruct models. • 4 items • Updated Sep 26, 2024 • 46
Japanese Stable LM Collection Suite of LLMs focusing on Japanese usage • 15 items • Updated May 7, 2024 • 18