Motoki Wu's picture

Motoki Wu

tokestermw

·

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

infgrad/jasper_en_vision_language_v1

liked a model 12 days ago

deepseek-ai/DeepSeek-V3-Base

liked a Space 13 days ago

osanseviero/gemini-coder

View all activity

Organizations

tokestermw's activity

upvoted a paper 17 days ago

Yi-Lightning Technical Report

Paper • 2412.01253 • Published Dec 2, 2024 • 25

upvoted a paper 18 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 18 days ago • 337

upvoted a paper 21 days ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 24 days ago • 136

upvoted a paper 30 days ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 41

upvoted a collection about 1 month ago

LLaMA-O1-1129 Datasets, Models, Codes and Papers

8 items • Updated Dec 3, 2024 • 18

upvoted 2 collections 2 months ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 101

C4AI Aya Expanse

Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 3 items • Updated 21 days ago • 30

upvoted a paper 2 months ago

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Paper • 2404.05719 • Published Apr 8, 2024 • 83

upvoted an article 3 months ago

Article

Our Transformers Code Agent beats the GAIA benchmark!

Jul 1, 2024

• 50

upvoted a paper 3 months ago

Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation

Paper • 2409.12941 • Published Sep 19, 2024 • 23

upvoted a collection 3 months ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18, 2024 • 225

upvoted an article 3 months ago

Article

Document Similarity Search with ColPali

By

•

Sep 21, 2024

• 48

upvoted 8 papers 4 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 136

OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs

Paper • 2409.05152 • Published Sep 8, 2024 • 31

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data

Paper • 2409.03810 • Published Sep 5, 2024 • 32

Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Paper • 2402.10110 • Published Feb 15, 2024 • 3

Hermes 3 Technical Report

Paper • 2408.11857 • Published Aug 15, 2024 • 43

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

Paper • 2409.02897 • Published Sep 4, 2024 • 45

Writing in the Margins: Better Inference Pattern for Long Context Retrieval

Paper • 2408.14906 • Published Aug 27, 2024 • 138

Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models

Paper • 2408.15915 • Published Aug 28, 2024 • 19