lsxi77777 commited on
Commit
5032e08
·
1 Parent(s): 13d79ca

Upload Models

Browse files
Files changed (5) hide show
  1. .gitignore +1 -0
  2. README.md +17 -0
  3. minima_lightglue.pth +3 -0
  4. minima_loftr.ckpt +3 -0
  5. minima_roma.pth +3 -0
.gitignore ADDED
@@ -0,0 +1 @@
 
 
1
+ .idea
README.md CHANGED
@@ -1,3 +1,20 @@
1
  ---
2
  license: apache-2.0
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
  ---
4
+
5
+ ## Abstract
6
+
7
+ Image matching for both cross-view and cross-modality plays a critical role in multimodal perception. In practice, the
8
+ modality gap caused by different imaging systems/styles poses great challenges to the matching task. Existing works try
9
+ to extract invariant features for specific modalities and train on limited datasets, showing poor generalization. In
10
+ this paper, we present MINIMA, a unified image matching framework for multiple cross-modal cases. Without pursuing fancy
11
+ modules, our MINIMA aims to enhance universal performance from the perspective of data scaling up. For such purpose, we
12
+ propose a simple yet effective data engine that can freely produce a large dataset containing multiple modalities, rich
13
+ scenarios, and accurate matching labels. Specifically, we scale up the modalities from cheap but rich RGB-only matching
14
+ data, by means of generative models. Under this setting, the matching labels and rich diversity of the RGB dataset are
15
+ well inherited by the generated multimodal data. Benefiting from this, we construct MD-syn, a new comprehensive dataset
16
+ that fills the data gap for general multimodal image matching. With MD-syn, we can directly train any advanced matching
17
+ pipeline on randomly selected modality pairs to obtain cross-modal ability. Extensive experiments on in-domain and
18
+ zero-shot matching tasks, including 19 cross-modal cases, demonstrate that our MINIMA can significantly outperform the
19
+ baselines and even surpass modality-specific methods. The dataset and code are available
20
+ at https://github.com/LSXI7/MINIMA .
minima_lightglue.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bd981be2f5e8cdab120bb0380d132bf7d2ec20cd61499f92f283660bf2f5b126
3
+ size 47484144
minima_loftr.ckpt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:810d19773ff898ba04a68c99a3eff9c112210bf884214bd76aec885e83b0e257
3
+ size 46350788
minima_roma.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:17f3923bd780e8f1450792706e0643f70c8864ad11bf5fd4f2b54714bac23538
3
+ size 445638463