Aria: An Open Multimodal Native Mixture-of-Experts Model Paper • 2410.05993 • Published Oct 8, 2024 • 108
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation Paper • 2410.13848 • Published Oct 17, 2024 • 32
WaveFit: An Iterative and Non-autoregressive Neural Vocoder based on Fixed-Point Iteration Paper • 2210.01029 • Published Oct 3, 2022 • 1