Mike Young PRO
mikelabs
AI & ML interests
None yet
Recent Activity
published
an
article
about 1 month ago
BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO
published
an
article
about 1 month ago
DeMo: Decoupled Momentum Optimization
reacted
to
their
post
with 😎
about 1 month ago
this paper is like when you tell an art student just draw it in your own style and they actually do it perfectly on the first try 🎨 diffusion models getting too powerful fr
https://www.aimodels.fyi/papers/arxiv/diffusion-self-distillation-zero-shot-customized-image
Articles
Organizations
mikelabs's activity
Video Depth without Video Models
Paper
•
2411.19189
•
Published
•
33
•
7
Video Depth without Video Models
Paper
•
2411.19189
•
Published
•
33
•
7
On Domain-Specific Post-Training for Multimodal Large Language Models
Paper
•
2411.19930
•
Published
•
25
•
3
Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS
Paper
•
2411.18478
•
Published
•
33
•
14
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
Paper
•
2411.10442
•
Published
•
71
•
4
VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Paper
•
2411.13281
•
Published
•
17
•
5
VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models
Paper
•
2411.13503
•
Published
•
30
•
3
SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration
Paper
•
2411.10958
•
Published
•
52
•
9
Continuous Speculative Decoding for Autoregressive Image Generation
Paper
•
2411.11925
•
Published
•
15
•
3
RedPajama: an Open Dataset for Training Large Language Models
Paper
•
2411.12372
•
Published
•
48
•
3
Generative World Explorer
Paper
•
2411.11844
•
Published
•
75
•
6
[Support] Community Articles
70
#5 opened 10 months ago
by
victor
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
Paper
•
2411.06558
•
Published
•
34
•
6
LLaVA-o1: Let Vision Language Models Reason Step-by-Step
Paper
•
2411.10440
•
Published
•
112
•
7
Cut Your Losses in Large-Vocabulary Language Models
Paper
•
2411.09009
•
Published
•
43
•
4
Direct Preference Optimization Using Sparse Feature-Level Constraints
Paper
•
2411.07618
•
Published
•
15
•
3
EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation
Paper
•
2411.08380
•
Published
•
25
•
3
Large Language Models Can Self-Improve in Long-context Reasoning
Paper
•
2411.08147
•
Published
•
63
•
4
IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization
Paper
•
2411.06208
•
Published
•
19
•
6
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models
Paper
•
2411.07126
•
Published
•
28
•
5