2 228 116

Yuseung "Phillip" Lee

phillipinseoul

https://phillipinseoul.github.io/

phillipinseoul

AI & ML interests

Computer Vision

Recent Activity

upvoted a paper about 2 hours ago

AutoPresent: Designing Structured Visuals from Scratch

upvoted a paper about 2 hours ago

Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation

upvoted a paper 1 day ago

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

View all activity

Organizations

phillipinseoul's activity

upvoted 2 papers about 2 hours ago

AutoPresent: Designing Structured Visuals from Scratch

Paper • 2501.00912 • Published 6 days ago • 7

Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation

Paper • 2501.03059 • Published 1 day ago • 12

upvoted 3 papers 1 day ago

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

Paper • 2501.00599 • Published 7 days ago • 39

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Paper • 2412.21059 • Published 8 days ago • 15

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM

Paper • 2501.01904 • Published 4 days ago • 22

upvoted 2 papers 2 days ago

Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Paper • 2501.01423 • Published 5 days ago • 33

LTX-Video: Realtime Video Latent Diffusion

Paper • 2501.00103 • Published 8 days ago • 36

upvoted 2 papers 5 days ago

MLLM-as-a-Judge for Image Safety without Human Labeling

Paper • 2501.00192 • Published 8 days ago • 22

ProgCo: Program Helps Self-Correction of Large Language Models

Paper • 2501.01264 • Published 6 days ago • 23

liked a dataset 5 days ago

ccvl/3DSRBench

Viewer • Updated 5 days ago • 5.16k • 24 • 4

upvoted 2 papers 7 days ago

Slow Perception: Let's Perceive Geometric Figures Step-by-step

Paper • 2412.20631 • Published 9 days ago • 12

PERSE: Personalized 3D Generative Avatars from A Single Portrait

Paper • 2412.21206 • Published 8 days ago • 15

liked 2 models 8 days ago

Efficient-Large-Model/Llama-3-VILA1.5-8B

Text Generation • Updated Aug 16, 2024 • 1.89k • 30

lmms-lab/llava-onevision-qwen2-7b-ov-chat

Text Generation • Updated Oct 23, 2024 • 2.41k • 18

liked a model 9 days ago

nyu-visionx/cambrian-13b

Text Generation • Updated Jun 28, 2024 • 71 • 19

upvoted a paper 12 days ago

MMFactory: A Universal Solution Search Engine for Vision-Language Tasks

Paper • 2412.18072 • Published 15 days ago • 14

liked a model 12 days ago

nyu-visionx/cambrian-8b

Text Generation • Updated Jun 28, 2024 • 1.98k • 61

upvoted 2 papers 12 days ago

Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

Paper • 2412.15797 • Published 19 days ago • 16

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published 15 days ago • 34

liked a Space 13 days ago

Running on CPU Upgrade

1.22k

🏢

Yuseung "Phillip" Lee

AI & ML interests

Recent Activity

Organizations

phillipinseoul's activity

Anychat