23 10 4

Kalle Hilsenbek

Bachstelze

https://bachstelze.gitlab.io/multisource/

Bachstelze

AI & ML interests

Combining BERT with instructions for explainable AI: gitlab.com/Bachstelze/instructionbert

Recent Activity

new activity 19 days ago

Nart/monolingual_ab:Goldfish model

new activity about 2 months ago

HuggingFaceTB/SmolLM2-360M-Instruct:Adding Evaluation Results

View all activity

Organizations

None yet

Bachstelze's activity

New activity in Nart/monolingual_ab 19 days ago

Goldfish model

#5 opened 19 days ago by

Bachstelze

New activity in HuggingFaceTB/SmolLM2-360M-Instruct about 2 months ago

Adding Evaluation Results

#6 opened about 2 months ago by

leaderboard-pr-bot

liked a dataset 2 months ago

KevinZ/oLMpics

Viewer • Updated Apr 19, 2022 • 38.3k • 37 • 1

upvoted a paper 3 months ago

Large Language Model Evaluation via Matrix Nuclear-Norm

Paper • 2410.10672 • Published Oct 14, 2024 • 19

New activity in HuggingFaceTB/SmolLM-135M 3 months ago

Benchmark results

#17 opened 3 months ago by

Bachstelze

commented a paper 3 months ago

Emergent properties with repeated examples

Paper • 2410.07041 • Published Oct 9, 2024 • 8 •

upvoted 2 papers 3 months ago

One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation

Paper • 2410.07170 • Published Oct 9, 2024 • 15

Only-IF:Revealing the Decisive Effect of Instruction Diversity on Generalization

Paper • 2410.04717 • Published Oct 7, 2024 • 18

commented a paper 3 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 169 •

upvoted a paper 3 months ago

Cottention: Linear Transformers With Cosine Attention

Paper • 2409.18747 • Published Sep 27, 2024 • 16

commented a paper 3 months ago

Discovering the Gems in Early Layers: Accelerating Long-Context LLMs with 1000x Input Token Reduction

Paper • 2409.17422 • Published Sep 25, 2024 • 25 •

upvoted a paper 3 months ago

EuroLLM: Multilingual Language Models for Europe

Paper • 2409.16235 • Published Sep 24, 2024 • 26

commented a paper 3 months ago

EuroLLM: Multilingual Language Models for Europe

Paper • 2409.16235 • Published Sep 24, 2024 • 26 •

New activity in Slim205/mmlu_ift 4 months ago

Readme

#1 opened 4 months ago by

Bachstelze

commented 2 papers 4 months ago

BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline

Paper • 2408.15079 • Published Aug 27, 2024 • 52 •

Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models

Paper • 2408.15518 • Published Aug 28, 2024 • 42 •

commented a paper 5 months ago

Better Alignment with Instruction Back-and-Forth Translation

Paper • 2408.04614 • Published Aug 8, 2024 • 15 •

upvoted a paper 5 months ago

Better Alignment with Instruction Back-and-Forth Translation

Paper • 2408.04614 • Published Aug 8, 2024 • 15

liked a dataset 5 months ago

gsarti/eureka-rebus

Viewer • Updated Sep 17, 2024 • 307k • 30 • 1

commented a paper 5 months ago

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31, 2024 • 76 •