Sugato Ray's picture

Sugato Ray

sugatoray

·

https://linkedin.com/in/sugatoray

AI & ML interests

None yet

Recent Activity

updated a collection about 20 hours ago

Leaderboards 🔥

updated a collection about 20 hours ago

Papers-Benchmarks

liked a dataset about 20 hours ago

m-ric/agents_small_benchmark

View all activity

Organizations

sugatoray's activity

upvoted a paper 1 day ago

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 187

upvoted a collection 1 day ago

GAIA release

Gather the items of the GAIA release • 4 items • Updated Nov 23, 2023 • 20

upvoted a collection 2 days ago

🤖 Agents

21 items • Updated 6 days ago • 80

upvoted an article 2 days ago

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

By

•

4 days ago

• 30

upvoted a collection 5 days ago

SwiftKV Models

SwiftKV reduces prefill compute by up to 50% by combining model rewiring and knowledge-preserving self-distillation. • 3 items • Updated Dec 5, 2024 • 3

upvoted a paper 5 days ago

Xmodel-2 Technical Report

Paper • 2412.19638 • Published 10 days ago • 23

upvoted an article 5 days ago

Article

Fine-tune ModernBERT for text classification using synthetic data

By

•

7 days ago

• 19

upvoted a paper 6 days ago

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published 22 days ago • 49

upvoted a paper 10 days ago

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published 13 days ago • 34

upvoted a paper 11 days ago

GUI Agents: A Survey

Paper • 2412.13501 • Published 20 days ago • 23

upvoted a collection 11 days ago

DeepSeek-V3

3 items • Updated about 20 hours ago • 98

upvoted a paper 12 days ago

Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published 13 days ago • 44

upvoted a collection 13 days ago

QVQ-72B-Preview

5 items • Updated 13 days ago • 7

upvoted an article 14 days ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29, 2024

• 260

upvoted 2 papers 14 days ago

MixLLM: LLM Quantization with Global Mixed-precision between Output-features and Highly-efficient System Design

Paper • 2412.14590 • Published 19 days ago • 13

SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator

Paper • 2412.12094 • Published 21 days ago • 10

upvoted a collection 14 days ago

QwQ

Qwen with Questions • 2 items • Updated Nov 28, 2024 • 54

upvoted a paper 14 days ago

Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents

Paper • 2412.13194 • Published 20 days ago • 12

upvoted 2 papers 16 days ago

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published 20 days ago • 91

Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture

Paper • 2412.11834 • Published 21 days ago • 6