Self-Training Enables Video Instruction Tuning with Any Supervision
Orr Zohar PRO
orrzohar
AI & ML interests
Large Multi-Modal Models, Foundation Models, Video Understanding
Recent Activity
upvoted
a
paper
about 6 hours ago
BoxingGym: Benchmarking Progress in Automated Experimental Design and
Model Discovery
upvoted
a
paper
about 6 hours ago
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning
for Image and Video Generation
upvoted
a
paper
about 6 hours ago
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction