lsxi77777
/

MINIMA

Model card Files Files and versions Community

lsxi77777 commited on 2 days ago

Commit

5032e08

·

1 Parent(s): 13d79ca

Upload Models

Files changed (5) hide show

.gitignore +1 -0
README.md +17 -0
minima_lightglue.pth +3 -0
minima_loftr.ckpt +3 -0
minima_roma.pth +3 -0

.gitignore ADDED Viewed

	@@ -0,0 +1 @@


1	+ .idea

README.md CHANGED Viewed

@@ -1,3 +1,20 @@
 ---
 license: apache-2.0
 ---

 ---
 license: apache-2.0
 ---
+## Abstract
+Image matching for both cross-view and cross-modality plays a critical role in multimodal perception. In practice, the
+modality gap caused by different imaging systems/styles poses great challenges to the matching task. Existing works try
+to extract invariant features for specific modalities and train on limited datasets, showing poor generalization. In
+this paper, we present MINIMA, a unified image matching framework for multiple cross-modal cases. Without pursuing fancy
+modules, our MINIMA aims to enhance universal performance from the perspective of data scaling up. For such purpose, we
+propose a simple yet effective data engine that can freely produce a large dataset containing multiple modalities, rich
+scenarios, and accurate matching labels. Specifically, we scale up the modalities from cheap but rich RGB-only matching
+data, by means of generative models. Under this setting, the matching labels and rich diversity of the RGB dataset are
+well inherited by the generated multimodal data. Benefiting from this, we construct MD-syn, a new comprehensive dataset
+that fills the data gap for general multimodal image matching. With MD-syn, we can directly train any advanced matching
+pipeline on randomly selected modality pairs to obtain cross-modal ability. Extensive experiments on in-domain and
+zero-shot matching tasks, including 19 cross-modal cases, demonstrate that our MINIMA can significantly outperform the
+baselines and even surpass modality-specific methods. The dataset and code are available
+at https://github.com/LSXI7/MINIMA .

minima_lightglue.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:bd981be2f5e8cdab120bb0380d132bf7d2ec20cd61499f92f283660bf2f5b126
+size 47484144

minima_loftr.ckpt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:810d19773ff898ba04a68c99a3eff9c112210bf884214bd76aec885e83b0e257
+size 46350788

minima_roma.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:17f3923bd780e8f1450792706e0643f70c8864ad11bf5fd4f2b54714bac23538
+size 445638463