Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2412.03187

Weighted-Reward Preference Optimization for Implicit Model Fusion

Paper • 2412.03187 • Published Dec 4, 2024 • 9

Preference Optimization for Implicit Model Fusion

Weighted-Reward Preference Optimization for Implicit Model Fusion

Paper • 2412.03187 • Published Dec 4, 2024 • 9
FuseAI/FuseChat-Llama-3.1-8B-Instruct

Updated 26 days ago • 211 • 7
FuseAI/FuseChat-Llama-3.2-3B-Instruct

Updated 26 days ago • 159 • 3
FuseAI/FuseChat-Llama-3.2-1B-Instruct

Updated 26 days ago • 436 • 4

Preference Optimization for Implicit Model Fusion

Weighted-Reward Preference Optimization for Implicit Model Fusion

Paper • 2412.03187 • Published Dec 4, 2024 • 9
FuseAI/FuseChat-Llama-3.1-8B-Instruct

Updated 26 days ago • 211 • 7
FuseAI/FuseChat-Llama-3.2-3B-Instruct

Updated 26 days ago • 159 • 3
FuseAI/FuseChat-Llama-3.2-1B-Instruct

Updated 26 days ago • 436 • 4

bitext/Bitext-travel-llm-chatbot-training-dataset

Viewer • Updated Aug 22, 2024 • 31.7k • 61
alexlawtengyi/travel_agentv1

Viewer • Updated Nov 22, 2024 • 691 • 29 • 1
yananchen/travelplanner_faft_filter_label45_pos517_neg1959

Viewer • Updated Nov 18, 2024 • 2k • 30
osunlp/TravelPlanner

Viewer • Updated Jul 14, 2024 • 1.23k • 2.6k • 47

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Paper • 2411.11504 • Published Nov 18, 2024 • 19
Top-nσ: Not All Logits Are You Need

Paper • 2411.07641 • Published Nov 12, 2024 • 19
Adaptive Decoding via Latent Preference Optimization

Paper • 2411.09661 • Published Nov 14, 2024 • 10
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

Paper • 2411.13476 • Published Nov 20, 2024 • 15

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 145
Orion-14B: Open-source Multilingual Large Language Models

Paper • 2401.12246 • Published Jan 20, 2024 • 12
MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24, 2024 • 53
MM-LLMs: Recent Advances in MultiModal Large Language Models

Paper • 2401.13601 • Published Jan 24, 2024 • 45

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs