1 10 4

Yifan Zeng

yokey

https://xhmy.github.io/

AI & ML interests

Large Language Model, Agentic AI, Deep Learning

Recent Activity

liked a model 10 days ago

sfairXC/FsfairX-LLaMA3-RM-v0.1

upvoted a paper 12 days ago

Token-Budget-Aware LLM Reasoning

upvoted a paper about 2 months ago

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

View all activity

Organizations

None yet

yokey's activity

liked a model 10 days ago

sfairXC/FsfairX-LLaMA3-RM-v0.1

Text Classification • Updated Oct 14, 2024 • 6.16k • 52

upvoted a paper 12 days ago

Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published 15 days ago • 44

upvoted a paper about 2 months ago

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Paper • 2411.14405 • Published Nov 21, 2024 • 58

New activity in google/gemma-2-9b about 2 months ago

RuntimeError: Index put requires the source and destination dtypes match, got BFloat16 for the destination and Float for the source.

#24 opened 6 months ago by

saireddy

upvoted 2 papers 2 months ago

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published Oct 28, 2024 • 77

OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization

Paper • 2410.19609 • Published Oct 25, 2024 • 17

authored a paper 3 months ago

TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling

Paper • 2410.16033 • Published Oct 18, 2024

liked a model 3 months ago

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

Text Generation • Updated Oct 25, 2024 • 281k • 1.97k

commented a paper 3 months ago

A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement

Paper • 2410.13828 • Published Oct 17, 2024 • 3 •

authored 2 papers 3 months ago

A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement

Paper • 2410.13828 • Published Oct 17, 2024 • 3

LLM-RankFusion: Mitigating Intrinsic Inconsistency in LLM-based Ranking

Paper • 2406.00231 • Published May 31, 2024

upvoted a paper 3 months ago

A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement

Paper • 2410.13828 • Published Oct 17, 2024 • 3

updated a collection 3 months ago

LLM

Collection

19 items • Updated Oct 17, 2024

liked a model 3 months ago

openai-community/gpt2

Text Generation • Updated Feb 19, 2024 • 7.9M • • 2.48k

upvoted a paper 4 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 136

updated 2 collections 4 months ago

LLM

Collection

19 items • Updated Oct 17, 2024

AI4Sci

Collection

1 item • Updated Sep 14, 2024