-
Meta-Learning a Dynamical Language Model
Paper • 1803.10631 • Published -
TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation
Paper • 2003.11963 • Published -
BigScience: A Case Study in the Social Construction of a Multilingual Large Language Model
Paper • 2212.04960 • Published • 1 -
Continuous Learning in a Hierarchical Multiscale Neural Network
Paper • 1805.05758 • Published • 1
stark
blizzard-neel
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs
updated
a collection
2 days ago
papers
upvoted
a
paper
2 days ago
Xmodel-2 Technical Report
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet