Julien BLANCHON's picture

Julien BLANCHON PRO

blanchon

·

AI & ML interests

Math

Recent Activity

upvoted a paper about 4 hours ago

CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models

liked a model about 9 hours ago

zer0int/CLIP-GmP-ViT-L-14

liked a model 4 days ago

seastar105/whisper-tiny-emo-speech-caption

View all activity

Organizations

blanchon's activity

upvoted a paper about 4 hours ago

CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models

Paper • 2407.15886 • Published Jul 21, 2024 • 2

upvoted a paper 4 days ago

In-Context LoRA for Diffusion Transformers

Paper • 2410.23775 • Published Oct 31, 2024 • 11

upvoted a paper 7 days ago

Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering

Paper • 2408.09702 • Published Aug 19, 2024 • 10

upvoted a collection 11 days ago

DeepSeek-V3

3 items • Updated about 21 hours ago • 98

upvoted an article 19 days ago

Article

Metric and Relative Monocular Depth Estimation: An Overview. Fine-Tuning Depth Anything V2 👐 📚

By

•

Jul 10, 2024

• 44

upvoted 2 collections 20 days ago

OneDiffusion

Collection of different version of OneDiffusion models • 8 items • Updated 8 days ago • 2

Bamba

Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated 19 days ago • 18

upvoted 2 collections about 1 month ago

WavTokenizer-Medium-Large

https://arxiv.org/abs/2408.16532 • 5 items • Updated Oct 23, 2024 • 6

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 186

upvoted 2 collections about 2 months ago

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 7 items • Updated about 3 hours ago • 32

Cephalo

Cephalo is a series of multimodal vision large language models (V-LLMs) designed to integrate visual and linguistic reasoning in materials science. • 15 items • Updated Oct 25, 2024 • 4

upvoted 2 papers about 2 months ago

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

Paper • 2411.09595 • Published Nov 14, 2024 • 71

RewriteLM: An Instruction-Tuned Large Language Model for Text Rewriting

Paper • 2305.15685 • Published May 25, 2023 • 4

upvoted a collection about 2 months ago

Cosmos Tokenizer

A suite of image and video tokenizers • 12 items • Updated 1 day ago • 29

upvoted a collection 2 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 15 days ago • 197

upvoted 3 papers 2 months ago

Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think

Paper • 2409.11355 • Published Sep 17, 2024 • 29

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published Oct 28, 2024 • 77

Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Paper • 2410.06940 • Published Oct 9, 2024 • 6

upvoted 2 collections 2 months ago

CC Dataset

6 items • Updated Nov 1, 2024 • 1

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 101