Julien BLANCHON's picture

Julien BLANCHON PRO

blanchon

·

AI & ML interests

Math

Recent Activity

upvoted a paper about 5 hours ago

CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models

liked a model about 9 hours ago

zer0int/CLIP-GmP-ViT-L-14

liked a model 4 days ago

seastar105/whisper-tiny-emo-speech-caption

View all activity

Organizations

blanchon's activity

commented a paper 3 months ago

WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines

Paper • 2410.12705 • Published Oct 16, 2024 • 30 •

New activity in blanchon/PixDiet 3 months ago

Potential benchmark

#1 opened 3 months ago by

New activity in cbensimon/zerogpu-quickstart 3 months ago

Create app.py

#7 opened 3 months ago by

New activity in enzostvs/lora-studio 7 months ago

add-animate-flip-everywhere

#14 opened 7 months ago by

New activity in codys12/MergeLlama 7 months ago

fix dataset

#1 opened over 1 year ago by

commented 15 papers 7 months ago

Guiding a Diffusion Model with a Bad Version of Itself

Paper • 2406.02507 • Published Jun 4, 2024 • 15 •

RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots

Paper • 2406.02523 • Published Jun 4, 2024 • 10 •

V-Express: Conditional Dropout for Progressive Training of Portrait Video Generation

Paper • 2406.02511 • Published Jun 4, 2024 • 9 •

I4VGen: Image as Stepping Stone for Text-to-Video Generation

Paper • 2406.02230 • Published Jun 4, 2024 • 16 •

Self-Improving Robust Preference Optimization

Paper • 2406.01660 • Published Jun 3, 2024 • 18 •

To Believe or Not to Believe Your LLM

Paper • 2406.02543 • Published Jun 4, 2024 • 32 •

Seed-TTS: A Family of High-Quality Versatile Speech Generation Models

Paper • 2406.02430 • Published Jun 4, 2024 • 31 •

PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs

Paper • 2406.02886 • Published Jun 5, 2024 • 8 •

Item-Language Model for Conversational Recommendation

Paper • 2406.02844 • Published Jun 5, 2024 • 9 •

Searching Priors Makes Text-to-Video Synthesis Better

Paper • 2406.03215 • Published Jun 5, 2024 • 11 •

PosterLLaVa: Constructing a Unified Multi-modal Layout Generator with LLM

Paper • 2406.02884 • Published Jun 5, 2024 • 15 •

Audio Mamba: Bidirectional State Space Model for Audio Representation Learning

Paper • 2406.03344 • Published Jun 5, 2024 • 18 •

Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration

Paper • 2406.01014 • Published Jun 3, 2024 • 31 •

Parrot: Multilingual Visual Instruction Tuning

Paper • 2406.02539 • Published Jun 4, 2024 • 35 •

Block Transformer: Global-to-Local Language Modeling for Fast Inference

Paper • 2406.02657 • Published Jun 4, 2024 • 37 •