Shawon Ashraf's picture

19 268

Shawon Ashraf

shawon

·

https://www.shawonashraf.com/

AI & ML interests

Multi-Modal NLP, LLM and RAG

Recent Activity

liked a dataset 14 days ago

HuggingFaceTB/finemath

liked a model 22 days ago

JeffreyXiang/TRELLIS-image-large

reacted to MohamedRashad's post with 🔥 22 days ago

For those Game Developers out there who wants a tool to generate them 3d assets of different game items. I built something for you 😅 https://huggingface.co/JeffreyXiang/TRELLIS-image-large + https://huggingface.co/Qwen/Qwen2.5-72B-Instruct + https://huggingface.co/Freepik/flux.1-lite-8B-alpha = https://huggingface.co/spaces/MohamedRashad/Game-Items-Generator Happy building 🎉

View all activity

Organizations

shawon's activity

upvoted a collection about 1 month ago

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 102

upvoted a paper 2 months ago

Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis

Paper • 2410.23320 • Published Oct 30, 2024 • 8

upvoted a collection 2 months ago

LongVU

7 items • Updated Oct 31, 2024 • 28

upvoted a paper 3 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 169

upvoted a collection 3 months ago

Image / Video Gen

Image Generation Using Diffusion-Based Methods: Tips and Techniques for Stable Diffusion • 32 items • Updated 2 days ago • 6

upvoted 5 papers 3 months ago

FreeInit: Bridging Initialization Gap in Video Diffusion Models

Paper • 2312.07537 • Published Dec 12, 2023 • 25

Image Copy Detection for Diffusion Models

Paper • 2409.19952 • Published Sep 30, 2024 • 13

Visual Question Decomposition on Multimodal Large Language Models

Paper • 2409.19339 • Published Sep 28, 2024 • 8

UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models

Paper • 2409.20551 • Published Sep 30, 2024 • 14

LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents

Paper • 2311.05437 • Published Nov 9, 2023 • 48

upvoted 2 articles 3 months ago

Article

Data is better together

Mar 4, 2024

• 8

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25, 2024

• 180

upvoted a collection 3 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 551

upvoted a collection 4 months ago

Flow-Judge-v0.1

Flow-Judge-v0.1 models • 5 items • Updated Sep 17, 2024 • 19

upvoted a collection 6 months ago

LLM Compiler

Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. • 4 items • Updated Jun 27, 2024 • 147

upvoted a paper 7 months ago

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10, 2024 • 66

upvoted an article 7 months ago

Article

Easily Train Models with H100 GPUs on NVIDIA DGX Cloud

Mar 18, 2024

• 6

upvoted 2 collections 9 months ago

Idefics2 🐶

Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated May 6, 2024 • 91

DBRX

DBRX is a mixture-of-experts (MoE) large language model trained from scratch by Databricks. • 3 items • Updated Mar 27, 2024 • 92