Mike Young's picture

Mike Young PRO

mikelabs

·

AI & ML interests

None yet

Recent Activity

published an article about 1 month ago

BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO

published an article about 1 month ago

DeMo: Decoupled Momentum Optimization

reacted to their post with 😎 about 1 month ago

this paper is like when you tell an art student just draw it in your own style and they actually do it perfectly on the first try 🎨 diffusion models getting too powerful fr https://www.aimodels.fyi/papers/arxiv/diffusion-self-distillation-zero-shot-customized-image

View all activity

Articles

BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO

DeMo: Decoupled Momentum Optimization

Reverse Thinking Makes LLMs Stronger Reasoners

SnapMem: Snapshot-based 3D Scene Memory for Embodied Exploration and Reasoning

AIGS: Generating Science from AI-Powered Automated Falsification

SEFD: Semantic-Enhanced Framework for Detecting LLM-Generated Text

Medical Video Generation for Disease Progression Simulation

Conversational Medical AI: Ready for Practice

Design2Code: Benchmarking Multimodal Code Generation for Automated Front-End Engineering

Brain-Inspired Efficient Pruning: Exploiting Criticality in Spiking Neural Networks

Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models

An Internet Voting System Fatally Flawed in Creative New Ways

SpikingNeRF: Making Bio-inspired Neural Networks See through the Real World

Robust ASR Error Correction with Conservative Data Filtering

Error Correction by Paying Attention to Both Acoustic and Confidence References for Automatic Speech Recognition

StableV2V: Stablizing Shape Consistency in Video-to-Video Editing

Modeling AdaGrad, RMSProp, and Adam with Integro-Differential Equations

The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use

Bridging the Visual Gap: Fine-Tuning Multimodal Models with Knowledge-Adapted Captions

Generative Agent Simulations of 1,000 People

That Chip Has Sailed: A Critique of Unfounded Skepticism Around AI for Chip Design

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

GPTree: Towards Explainable Decision-Making via LLM-powered Decision Trees

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Organizations

mikelabs's activity

commented 4 papers about 1 month ago

Video Depth without Video Models

Paper • 2411.19189 • Published Nov 28, 2024 • 33 •

Video Depth without Video Models

Paper • 2411.19189 • Published Nov 28, 2024 • 33 •

On Domain-Specific Post-Training for Multimodal Large Language Models

Paper • 2411.19930 • Published Nov 29, 2024 • 25 •

Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS

Paper • 2411.18478 • Published Nov 27, 2024 • 33 •

commented 7 papers about 2 months ago

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Paper • 2411.10442 • Published Nov 15, 2024 • 71 •

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

Paper • 2411.13281 • Published Nov 20, 2024 • 17 •

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

Paper • 2411.13503 • Published Nov 20, 2024 • 30 •

SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Paper • 2411.10958 • Published Nov 17, 2024 • 52 •

Continuous Speculative Decoding for Autoregressive Image Generation

Paper • 2411.11925 • Published Nov 18, 2024 • 15 •

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 48 •

Generative World Explorer

Paper • 2411.11844 • Published Nov 18, 2024 • 75 •

New activity in blog-explorers/README about 2 months ago

[Support] Community Articles

#5 opened 10 months ago by

commented 8 papers about 2 months ago

Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement

Paper • 2411.06558 • Published Nov 10, 2024 • 34 •

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 112 •

Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published Nov 13, 2024 • 43 •

Direct Preference Optimization Using Sparse Feature-Level Constraints

Paper • 2411.07618 • Published Nov 12, 2024 • 15 •

EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation

Paper • 2411.08380 • Published Nov 13, 2024 • 25 •

Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published Nov 12, 2024 • 63 •

IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization

Paper • 2411.06208 • Published Nov 9, 2024 • 19 •

Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models

Paper • 2411.07126 • Published Nov 11, 2024 • 28 •