Apolinário from multimodal AI art's picture

Apolinário from multimodal AI art PRO

multimodalart

·

https://multimodal.art

AI & ML interests

None yet

Recent Activity

updated a Space 3 days ago

multimodalart/flux-fill-outpaint

liked a model 8 days ago

fudan-generative-ai/hallo3

updated a Space 8 days ago

multimodalart/flux-lora-the-explorer

View all activity

Articles

🧨 Diffusers welcomes Stable Diffusion 3.5 Large

Advanced Flux Dreambooth LoRA Training with 🧨 diffusers

🧨 Diffusers welcomes Stable Diffusion 3

LoRA training scripts of the world, unite!

SDXL in 4 steps with Latent Consistency LoRAs

Running IF with 🧨 diffusers on a Free Tier Google Colab

Train your ControlNet with diffusers

Organizations

multimodalart's activity

upvoted a collection 26 days ago

[MASK] is All You Need

Code, dataset, and pretrained model • 5 items • Updated Nov 29, 2024 • 8

upvoted a collection about 1 month ago

PaliGemma 2 Release

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated 22 days ago • 122

upvoted a paper about 1 month ago

FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait

Paper • 2412.01064 • Published Dec 2, 2024 • 25

upvoted a collection 2 months ago

Stable Diffusion 3.5

6 items • Updated Oct 29, 2024 • 118

upvoted 2 papers 3 months ago

Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations

Paper • 2410.10792 • Published Oct 14, 2024 • 29

Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models

Paper • 2410.02416 • Published Oct 3, 2024 • 26

upvoted 2 collections 3 months ago

Loradex Highlights

This collection features awesome opensource LoRAs trained by members of the Glif Community during Loradex Early Access! • 14 items • Updated Oct 18, 2024 • 19

Emu3

Emu3: Next-Token Prediction is All You Need • 5 items • Updated 14 days ago • 67

upvoted 2 articles 4 months ago

Article

Getty Images Brings High-Quality, Commercially Safe Dataset to Hugging Face

By

•

Sep 6, 2024

• 16

Article

Enhancing Image Model Dreambooth Training Through Effective Captioning: Key Observations

By

•

Jun 19, 2024

• 17

upvoted a collection 4 months ago

CogVideo

10 items • Updated Nov 27, 2024 • 45

upvoted an article 4 months ago

Article

Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models

By

•

Aug 26, 2024

• 37

upvoted 5 papers 5 months ago

Imagen 3

Paper • 2408.07009 • Published Aug 13, 2024 • 61

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

Paper • 2408.06072 • Published Aug 12, 2024 • 37

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12, 2024 • 117

IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts

Paper • 2408.03209 • Published Aug 6, 2024 • 21

Discrete Flow Matching

Paper • 2407.15595 • Published Jul 22, 2024 • 13

upvoted 3 papers 6 months ago

Scaling Diffusion Transformers to 16 Billion Parameters

Paper • 2407.11633 • Published Jul 16, 2024 • 25

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 160

Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions

Paper • 2407.06723 • Published Jul 9, 2024 • 11