26 11 63

Ivan Fioravanti PRO

ivanfioravanti

AI & ML interests

None yet

Recent Activity

liked a dataset about 19 hours ago

cognitivecomputations/OpenCoder-LLM_opc-sft-stage2-labeled

liked a dataset about 19 hours ago

cognitivecomputations/OpenCoder-LLM_opc-sft-stage1-labeled

upvoted an article 2 days ago

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

View all activity

Organizations

ivanfioravanti's activity

upvoted an article 2 days ago

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

•

2 days ago

• 27

upvoted a paper 6 days ago

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published 20 days ago • 46

upvoted 3 papers 15 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 18 days ago • 116

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 16 days ago • 334

No More Adam: Learning Rate Scaling at Initialization is All You Need

Paper • 2412.11768 • Published 19 days ago • 41

upvoted an article 26 days ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

•

Dec 4, 2024

• 75

upvoted an article about 2 months ago

Article

Releasing the largest multilingual open pretraining dataset

•

Nov 13, 2024

• 98

upvoted an article 8 months ago

Article

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

•

Apr 29, 2024

• 29

upvoted an article 9 months ago

Article

RAG Empowerment: Cohere C4AI Command-R and Transformers Unveiled

•

Apr 7, 2024

• 10