Arash Ahmadian

ArashAhmadian

aahmadian_

AI & ML interests

None yet

Recent Activity

authored a paper 19 days ago

If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs

View all activity

Articles

A Deepdive into Aya Expanse: Advancing the Frontier of Multilinguality

Oct 24, 2024

• 60

Putting RL back in RLHF

Jun 12, 2024

• 66

Organizations

ArashAhmadian's activity

authored a paper 19 days ago

If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs

Paper • 2412.04144 • Published Dec 5, 2024 • 4

authored 2 papers 6 months ago

RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs

Paper • 2407.02552 • Published Jul 2, 2024 • 4

Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning

Paper • 2309.05444 • Published Sep 11, 2023 • 1

updated 3 models 7 months ago

upvoted a paper 7 months ago

Self-Improving Robust Preference Optimization

Paper • 2406.01660 • Published Jun 3, 2024 • 18

updated 4 models 7 months ago

ArashAhmadian/ppo_6.9b_new

Text Generation • Updated Jun 7, 2024 • 23

ArashAhmadian/rloo_6.9b_new

Text Generation • Updated Jun 7, 2024 • 23

ArashAhmadian/rloo_7b_f

Feature Extraction • Updated Jun 6, 2024 • 13

ArashAhmadian/ppo_rloo_bp_7b

Feature Extraction • Updated Jun 6, 2024 • 13

authored a paper 7 months ago

Self-Improving Robust Preference Optimization

Paper • 2406.01660 • Published Jun 3, 2024 • 18

updated 4 models 7 months ago

ArashAhmadian/rloo_tldr_6.9b_defaultclip_512bs_05kl

Text Generation • Updated Jun 4, 2024 • 19

ArashAhmadian/rloo_tldr_6.9b_noratioclip

Text Generation • Updated Jun 1, 2024 • 24

ArashAhmadian/rloo_tldr_6.9b_ds2

Text Generation • Updated May 30, 2024 • 21

ArashAhmadian/rloo_tldr_6.9b

Text Generation • Updated May 27, 2024 • 18

authored a paper 7 months ago

Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

Paper • 2402.14740 • Published Feb 22, 2024 • 12

upvoted a paper 7 months ago

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning

Paper • 2402.06619 • Published Feb 9, 2024 • 54

updated a model 8 months ago

ArashAhmadian/bb_repro

Text Generation • Updated May 24, 2024 • 24