YanshekWoo commited on
Commit
685312f
·
verified ·
1 Parent(s): ca4dcbe
Files changed (1) hide show
  1. README.md +5 -3
README.md CHANGED
@@ -20127,9 +20127,10 @@ KaLM-embedding-multilingual-mini is trained from [Qwen/Qwen2-0.5B](https://huggi
20127
  - [x] Model Checkpoint
20128
  - [x] [KaLM-embedding-multilingual-mini-v1](https://huggingface.co/HIT-TMG/KaLM-embedding-multilingual-mini-v1)
20129
  - [x] [KaLM-embedding-multilingual-mini-instruct-v1](https://huggingface.co/HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1)
 
20130
  - [ ] KaLM-embedding-multilingual-max-v1
20131
- - [ ] Technical Report
20132
- - [ ] Training and Evaluation Code
20133
  - [ ] Training Data
20134
 
20135
 
@@ -20141,7 +20142,8 @@ KaLM-embedding-multilingual-mini is trained from [Qwen/Qwen2-0.5B](https://huggi
20141
  | [bge-m3 (dense)](https://huggingface.co/BAAI/bge-m3) | 560M | 60.80 | 59.84 | 60.32
20142
  | [gte-multilingual-base (dense)](https://huggingface.co/Alibaba-NLP/gte-multilingual-base) | **305M** | 62.72 | 61.40 | 62.06
20143
  | [KaLM-embedding-multilingual-mini-v1](https://huggingface.co/HIT-TMG/KaLM-embedding-multilingual-mini-v1) | 494M | 62.31 | 61.87 | 62.09
20144
- | [KaLM-embedding-multilingual-mini-instruct-v1](https://huggingface.co/HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1) | 494M | **63.57** | **64.74** | **64.16**
 
20145
 
20146
 
20147
 
 
20127
  - [x] Model Checkpoint
20128
  - [x] [KaLM-embedding-multilingual-mini-v1](https://huggingface.co/HIT-TMG/KaLM-embedding-multilingual-mini-v1)
20129
  - [x] [KaLM-embedding-multilingual-mini-instruct-v1](https://huggingface.co/HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1)
20130
+ - [x] [KaLM-embedding-multilingual-mini-instruct-v1.5](https://huggingface.co/HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1.5)
20131
  - [ ] KaLM-embedding-multilingual-max-v1
20132
+ - [x] Training and Evaluation Code: [HITsz-TMG/KaLM-Embedding](https://github.com/HITsz-TMG/KaLM-Embedding)
20133
+ - [x] Technical Report: [KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model](https://arxiv.org/abs/2501.01028)
20134
  - [ ] Training Data
20135
 
20136
 
 
20142
  | [bge-m3 (dense)](https://huggingface.co/BAAI/bge-m3) | 560M | 60.80 | 59.84 | 60.32
20143
  | [gte-multilingual-base (dense)](https://huggingface.co/Alibaba-NLP/gte-multilingual-base) | **305M** | 62.72 | 61.40 | 62.06
20144
  | [KaLM-embedding-multilingual-mini-v1](https://huggingface.co/HIT-TMG/KaLM-embedding-multilingual-mini-v1) | 494M | 62.31 | 61.87 | 62.09
20145
+ | [KaLM-embedding-multilingual-mini-instruct-v1](https://huggingface.co/HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1) | 494M | 63.57 | 64.74 | 64.16
20146
+ | [KaLM-embedding-multilingual-mini-instruct-v1.5](https://huggingface.co/HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1.5) | 494M | **64.13** | **64.94** | **64.53**
20147
 
20148
 
20149