Shyam Sunder Kumar's picture

Shyam Sunder Kumar

theainerd

·

AI & ML interests

Natural Language Processing

Recent Activity

upvoted a paper 1 day ago

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

liked a model 1 day ago

Qwen/Qwen2.5-Coder-32B-Instruct

upvoted a collection 1 day ago

Google's Gemma models family

View all activity

Organizations

theainerd's activity

upvoted a paper 1 day ago

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Paper • 2411.10442 • Published Nov 15, 2024 • 71

upvoted a collection 1 day ago

Google's Gemma models family

243 items • Updated 24 days ago • 61

upvoted a paper 3 days ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 34

upvoted a collection 4 days ago

🤖 Agents

21 items • Updated 6 days ago • 80

upvoted a paper 7 days ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published 12 days ago • 86

upvoted a collection 20 days ago

Scaling Test-Time Compute with Open Models

Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated about 18 hours ago • 19

upvoted a paper 20 days ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 54

upvoted a paper 25 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published 26 days ago • 97

upvoted a paper 27 days ago

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published 28 days ago • 68

upvoted a collection 29 days ago

LLM Reasoning Papers

Papers to improve reasoning capabilities of LLMs • 17 items • Updated 15 days ago • 93

upvoted a collection 4 months ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated 15 days ago • 208

upvoted a paper 5 months ago

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31, 2024 • 76

upvoted an article 5 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29, 2024

• 260

upvoted a paper 5 months ago

MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains

Paper • 2407.18961 • Published Jul 18, 2024 • 40

upvoted 2 collections 5 months ago

Research projects on top of vLLM

Papers cited in https://blog.vllm.ai/2024/07/25/lfai-perf.html • 6 items • Updated Jul 29, 2024 • 12

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 638

upvoted 3 collections 6 months ago

Preference Datasets for DPO

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated 26 days ago • 38

NuminaMath

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 6 items • Updated Jul 21, 2024 • 69

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Nov 28, 2024 • 354

upvoted a paper 6 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 160