LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token Paper • 2501.03895 • Published 11 days ago • 48
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token Paper • 2501.03895 • Published 11 days ago • 48
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token Paper • 2501.03895 • Published 11 days ago • 48 • 4
LLaMA-Omni: Seamless Speech Interaction with Large Language Models Paper • 2409.06666 • Published Sep 10, 2024 • 56
TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space Paper • 2402.17811 • Published Feb 27, 2024 • 1
BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models Paper • 2306.10968 • Published Jun 19, 2023 • 7
TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space Paper • 2402.17811 • Published Feb 27, 2024 • 1
BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models Paper • 2306.10968 • Published Jun 19, 2023 • 7