Dmitry Tishencko's picture

Dmitry Tishencko

DiTy

·

AI & ML interests

NLP/NLG, Language Modeling, Metric Learning

Recent Activity

liked a dataset 16 days ago

reciTAL/mlsum

new activity 30 days ago

DiTy/gemma-2-9b-it-russian-strict-function-calling-DPO:Multi-function-calling

updated a model about 1 month ago

DiTy/gemma-2-9b-it-russian-function-calling-GGUF

View all activity

Organizations

DiTy's activity

upvoted 2 papers about 2 months ago

xLAM: A Family of Large Action Models to Empower AI Agent Systems

Paper • 2409.03215 • Published Sep 5, 2024 • 4

AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning

Paper • 2402.15506 • Published Feb 23, 2024 • 14

upvoted 2 articles 3 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

• 124

Article

How to generate text: using different decoding methods for language generation with Transformers

Mar 1, 2020

• 133

upvoted a collection 6 months ago

Gemma 2 Release

15 items • Updated 25 days ago • 206

upvoted a collection 7 months ago

C4AI Aya 23

Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. • 4 items • Updated Dec 3, 2024 • 51

upvoted a paper 7 months ago

Item-Language Model for Conversational Recommendation

Paper • 2406.02844 • Published Jun 5, 2024 • 9

upvoted a paper 9 months ago

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

Paper • 2404.05961 • Published Apr 9, 2024 • 65

upvoted a paper 11 months ago

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Paper • 2401.06066 • Published Jan 11, 2024 • 44

upvoted a paper over 1 year ago

Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Paper • 2309.10020 • Published Sep 18, 2023 • 40