8 16 1

Daniel Korat

danielkorat

AI & ML interests

Inference acceleration, Low-resource NLP, Few-shot learning

Recent Activity

upvoted a paper about 1 month ago

FastDraft: How to Train Your Draft

updated a dataset about 1 month ago

huggingface/documentation-images

new activity about 1 month ago

huggingface/documentation-images:Upload method-animation.mov

View all activity

Articles

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

Jan 30, 2024

• 9

SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit

Dec 6, 2023

• 6

SetFit: Efficient Few-Shot Learning Without Prompts

Sep 26, 2022

• 21

Organizations

danielkorat's activity

upvoted a paper about 1 month ago

FastDraft: How to Train Your Draft

Paper • 2411.11055 • Published Nov 17, 2024 • 9

updated a dataset about 1 month ago

huggingface/documentation-images

Viewer • Updated 3 days ago • 50 • 2.19M • 45

New activity in huggingface/documentation-images about 1 month ago

Upload method-animation.mov

#394 opened about 1 month ago by

danielkorat

New activity in huggingface/documentation-images 3 months ago

Upload method-animation.mov

#375 opened 3 months ago by

danielkorat

upvoted 3 articles 3 months ago

Article

Assisted Generation: a new direction toward low-latency text generation

May 11, 2023

• 38

Article

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

Apr 3, 2024

• 10

Article

Faster Assisted Generation with Dynamic Speculation

Oct 8, 2024

• 44

upvoted an article 5 months ago

Article

SetFit: Efficient Few-Shot Learning Without Prompts

Sep 26, 2022

• 21

upvoted a paper 5 months ago

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

Paper • 2408.02545 • Published Aug 5, 2024 • 35

upvoted an article 6 months ago

Article

Our Transformers Code Agent beats the GAIA benchmark!

Jul 1, 2024

• 49

upvoted an article 7 months ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

• 167

upvoted 2 papers 7 months ago

Accelerating Speculative Decoding using Dynamic Speculation Length

Paper • 2405.04304 • Published May 7, 2024 • 2

Distributed Speculative Inference of Large Language Models

Paper • 2405.14105 • Published May 23, 2024 • 16

authored a paper 7 months ago

Accelerating Speculative Decoding using Dynamic Speculation Length

Paper • 2405.04304 • Published May 7, 2024 • 2

authored a paper 8 months ago

Distributed Speculative Inference of Large Language Models

Paper • 2405.14105 • Published May 23, 2024 • 16

New activity in lmsys/vicuna-13b-v1.3 8 months ago

Adding `safetensors` variant of this model

#5 opened 9 months ago by

SFconvertbot

New activity in lmsys/vicuna-7b-v1.3 8 months ago

Adding `safetensors` variant of this model

#4 opened 12 months ago by

SFconvertbot

New activity in bigcode/starcoder 8 months ago

Adding `safetensors` variant of this model

#112 opened 9 months ago by

SFconvertbot

upvoted 2 articles 8 months ago

Article

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

May 9, 2024

• 12

Article

Introducing the Open Leaderboard for Hebrew LLMs!

May 5, 2024

• 32

Daniel Korat

AI & ML interests

Recent Activity

Articles

Universal Assisted Generation: Faster Decoding with Any Assistant Model

Faster Assisted Generation with Dynamic Speculation

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit

SetFit: Efficient Few-Shot Learning Without Prompts

Organizations

danielkorat's activity

Upload method-animation.mov

Upload method-animation.mov

Assisted Generation: a new direction toward low-latency text generation

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

Faster Assisted Generation with Dynamic Speculation

SetFit: Efficient Few-Shot Learning Without Prompts

Our Transformers Code Agent beats the GAIA benchmark!

Training and Finetuning Embedding Models with Sentence Transformers v3

Adding `safetensors` variant of this model

Adding `safetensors` variant of this model

Adding `safetensors` variant of this model

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

Introducing the Open Leaderboard for Hebrew LLMs!