Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
84
10
15
Guilherme Penedo
guipenedo
Follow
Odyssey33's profile picture
nndr72's profile picture
shollyking's profile picture
743 followers
·
6 following
gui_penedo
guipenedo
AI & ML interests
None yet
Articles
FineWeb2-C: Help Build Better Language Models in Your Language
14 days ago
•
11
Organizations
guipenedo
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a dataset
30 days ago
HuggingFaceFW/fineweb-2
Viewer
•
Updated
30 days ago
•
13.8B
•
110k
•
383
liked
a Space
about 1 month ago
Running
34
💬
Discussion Forum
liked
a model
2 months ago
HuggingFaceTB/SmolLM2-1.7B-Instruct
Text Generation
•
Updated
Dec 4, 2024
•
88.2k
•
458
liked
2 Spaces
3 months ago
Running
50
📝
Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks
Running
93
📖
TxT360: Trillion Extracted Text
liked
a model
3 months ago
cis-lmu/glotlid
Text Classification
•
Updated
Oct 26, 2024
•
7.06k
•
52
liked
a dataset
3 months ago
tiiuae/falcon-refinedweb
Viewer
•
Updated
Jun 20, 2023
•
968M
•
25.4k
•
826
liked
a Space
5 months ago
Running
366
🧽
Finegrain Object Eraser
Erase any object just by naming it!
liked
3 models
6 months ago
HuggingFaceTB/SmolLM-1.7B
Text Generation
•
Updated
Oct 16, 2024
•
8.58k
•
164
HuggingFaceTB/SmolLM-1.7B-Instruct
Text Generation
•
Updated
Aug 18, 2024
•
51.5k
•
107
AI-MO/NuminaMath-7B-TIR
Text Generation
•
Updated
Aug 14, 2024
•
2.58k
•
322
liked
a dataset
7 months ago
HuggingFaceFW/fineweb-edu
Updated
about 8 hours ago
•
193k
•
586
liked
a Space
7 months ago
Running
552
🍷
FineWeb: decanting the web for the finest text data at scale
liked
a dataset
9 months ago
HuggingFaceFW/fineweb
Viewer
•
Updated
3 days ago
•
48.6B
•
175k
•
1.81k
liked
a Space
about 1 year ago
Running
206
🚀
GPT Baker