Upload Models
Browse files- .gitignore +1 -0
- README.md +17 -0
- minima_lightglue.pth +3 -0
- minima_loftr.ckpt +3 -0
- minima_roma.pth +3 -0
.gitignore
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
.idea
|
README.md
CHANGED
@@ -1,3 +1,20 @@
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
---
|
4 |
+
|
5 |
+
## Abstract
|
6 |
+
|
7 |
+
Image matching for both cross-view and cross-modality plays a critical role in multimodal perception. In practice, the
|
8 |
+
modality gap caused by different imaging systems/styles poses great challenges to the matching task. Existing works try
|
9 |
+
to extract invariant features for specific modalities and train on limited datasets, showing poor generalization. In
|
10 |
+
this paper, we present MINIMA, a unified image matching framework for multiple cross-modal cases. Without pursuing fancy
|
11 |
+
modules, our MINIMA aims to enhance universal performance from the perspective of data scaling up. For such purpose, we
|
12 |
+
propose a simple yet effective data engine that can freely produce a large dataset containing multiple modalities, rich
|
13 |
+
scenarios, and accurate matching labels. Specifically, we scale up the modalities from cheap but rich RGB-only matching
|
14 |
+
data, by means of generative models. Under this setting, the matching labels and rich diversity of the RGB dataset are
|
15 |
+
well inherited by the generated multimodal data. Benefiting from this, we construct MD-syn, a new comprehensive dataset
|
16 |
+
that fills the data gap for general multimodal image matching. With MD-syn, we can directly train any advanced matching
|
17 |
+
pipeline on randomly selected modality pairs to obtain cross-modal ability. Extensive experiments on in-domain and
|
18 |
+
zero-shot matching tasks, including 19 cross-modal cases, demonstrate that our MINIMA can significantly outperform the
|
19 |
+
baselines and even surpass modality-specific methods. The dataset and code are available
|
20 |
+
at https://github.com/LSXI7/MINIMA .
|
minima_lightglue.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:bd981be2f5e8cdab120bb0380d132bf7d2ec20cd61499f92f283660bf2f5b126
|
3 |
+
size 47484144
|
minima_loftr.ckpt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:810d19773ff898ba04a68c99a3eff9c112210bf884214bd76aec885e83b0e257
|
3 |
+
size 46350788
|
minima_roma.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:17f3923bd780e8f1450792706e0643f70c8864ad11bf5fd4f2b54714bac23538
|
3 |
+
size 445638463
|