Konstantinos Kakkavas

kkakkavas

kkakkavas

AI & ML interests

- NLP - CV - docVQA

Recent Activity

upvoted a paper 11 days ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

upvoted a paper 11 days ago

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

liked a model about 2 months ago

Xkev/Llama-3.2V-11B-cot

View all activity

Organizations

kkakkavas's activity

upvoted 2 papers 11 days ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published 17 days ago • 95

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published 11 days ago • 48

liked a model about 2 months ago

Xkev/Llama-3.2V-11B-cot

Image-Text-to-Text • Updated Dec 16, 2024 • 7.57k • 142

liked a Space 4 months ago

Sleeping

📉

KIE Engines Comparison

updated a Space 4 months ago

Sleeping

📉

KIE Engines Comparison

liked 2 models 6 months ago

bartowski/Meta-Llama-3-8B-Instruct-GGUF

Text Generation • Updated Apr 29, 2024 • 3.18k • 93

naver-clova-ocr/bros-base-uncased

Feature Extraction • Updated Apr 5, 2022 • 29.3k • 18

liked a dataset 6 months ago

lmms-lab/DocVQA

Viewer • Updated Apr 18, 2024 • 16.6k • 9.94k • 29

liked a Space 6 months ago

Running

📚

Groq-LLaMA3.x

Groq & Llama3.x updated

updated a Space 6 months ago

Runtime error

🌍

Sennodipoi LayoutLMv3 KleisterNDA

liked a Space 7 months ago

Runtime error

159

📚

DocOwl

upvoted a collection 7 months ago

Table Transformer

Collection

The Table Transformer (TATR) is a series of object detection models useful for table extraction from PDF images. • 5 items • Updated 10 days ago • 20

liked 2 models 7 months ago

mPLUG/DocOwl1.5

Updated Apr 10, 2024 • 53 • 26

JinghuiLuAstronaut/DocLLM_baichuan2_7b

Text Generation • Updated Feb 29, 2024 • 21 • 4

updated a Space 8 months ago

No application file

🏃

Konstantinos Kakkavas

AI & ML interests

Recent Activity

Organizations

kkakkavas's activity

KIE Engines Comparison

KIE Engines Comparison

Groq-LLaMA3.x

Sennodipoi LayoutLMv3 KleisterNDA

DocOwl

README