Jenish-23's picture

7 8 38

Jenish-23

Jenish-23

·

jenish2014

AI & ML interests

Personal and Study

Recent Activity

new activity 18 days ago

OpenGVLab/InternVL2_5-4B-AWQ:Is it possible to use this model with huggingface's transformers library?

View all activity

Organizations

None yet

Jenish-23's activity

upvoted 4 papers 11 months ago

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 126

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

Paper • 2402.14658 • Published Feb 22, 2024 • 82

Neural Network Diffusion

Paper • 2402.13144 • Published Feb 20, 2024 • 95

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

Paper • 2402.10176 • Published Feb 15, 2024 • 36

upvoted a collection 12 months ago

Pretrained Text-Generation Models Below 250M Parameters

Great candidates for fine-tuning targeting Transformers.js, ordered by number of parameters. • 9 items • Updated Dec 3, 2024 • 7

upvoted a paper about 1 year ago

TinyLlama: An Open-Source Small Language Model

Paper • 2401.02385 • Published Jan 4, 2024 • 90

upvoted 2 collections about 1 year ago

Small_Language_Models

23 items • Updated Feb 1, 2024 • 1

Trained Models 🏋️

They may be small, but they're training like giants! • 8 items • Updated Dec 3, 2024 • 17