-
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 181 -
CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection
Paper • 2310.02960 • Published • 1 -
microsoft/phi-2
Text Generation • Updated • 155k • 3.26k
Bernd Schickerbauer
Beschick
AI & ML interests
None yet
Recent Activity
liked
a model
about 2 months ago
vidore/colpali-v1.2
liked
a model
about 2 months ago
vidore/colpali
upvoted
an
article
about 2 months ago
ColPali: Efficient Document Retrieval with Vision Language Models 👀
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet