mobinx
's Collections
AIPAPER
updated
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video
Synthesis
Paper
•
2402.14797
•
Published
•
20
Subobject-level Image Tokenization
Paper
•
2402.14327
•
Published
•
17
MobileLLM: Optimizing Sub-billion Parameter Language Models for
On-Device Use Cases
Paper
•
2402.14905
•
Published
•
126
GPTVQ: The Blessing of Dimensionality for LLM Quantization
Paper
•
2402.15319
•
Published
•
19
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper
•
2402.17764
•
Published
•
605
When Scaling Meets LLM Finetuning: The Effect of Data, Model and
Finetuning Method
Paper
•
2402.17193
•
Published
•
23
EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs
Paper
•
2403.02775
•
Published
•
11
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Paper
•
2403.03507
•
Published
•
183
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and
Diffusion Models
Paper
•
2403.03100
•
Published
•
34
LLM Agent Operating System
Paper
•
2403.16971
•
Published
•
65
CodexGraph: Bridging Large Language Models and Code Repositories via
Code Graph Databases
Paper
•
2408.03910
•
Published
•
16
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
Paper
•
2408.06072
•
Published
•
37
Paper
•
2408.05366
•
Published
•
12
Transfusion: Predict the Next Token and Diffuse Images with One
Multi-Modal Model
Paper
•
2408.11039
•
Published
•
58
TVG: A Training-free Transition Video Generation Method with Diffusion
Models
Paper
•
2408.13413
•
Published
•
14
MobileQuant: Mobile-friendly Quantization for On-device Language Models
Paper
•
2408.13933
•
Published
•
14
OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs
Paper
•
2409.05152
•
Published
•
31
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with
100+ NLP Researchers
Paper
•
2409.04109
•
Published
•
44
RATIONALYST: Pre-training Process-Supervision for Improving Reasoning
Paper
•
2410.01044
•
Published
•
34
Aligning Large Language Models via Self-Steering Optimization
Paper
•
2410.17131
•
Published
•
21
Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis
from Scratch
Paper
•
2410.18693
•
Published
•
40
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum
Reinforcement Learning
Paper
•
2411.02337
•
Published
•
35
Natural Language Reinforcement Learning
Paper
•
2411.14251
•
Published
•
28
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions
Paper
•
2411.14405
•
Published
•
58
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's
Reasoning Capability
Paper
•
2411.19943
•
Published
•
56
PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital
World
Paper
•
2412.17589
•
Published
•
12
Ensembling Large Language Models with Process Reward-Guided Tree Search
for Better Complex Reasoning
Paper
•
2412.15797
•
Published
•
16
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via
Collective Monte Carlo Tree Search
Paper
•
2412.18319
•
Published
•
34