Open Platform for Enterprise AI

non-profit

https://opea.dev/

AI & ML interests

Enterprise AI, RAG, GenAI

Recent Activity

cicdatopea new activity about 13 hours ago

OPEA/deepseek-vl2-int4-sym-gptq-inc:ValueError: Invalid modules, at least two modules detected as dependent, {shortest_module} and {longest_module}

cicdatopea updated a model about 13 hours ago

OPEA/DeepSeek-V3-int4-sym-gptq-inc

cicdatopea updated a model about 14 hours ago

OPEA/DeepSeek-V3-int4-sym-awq-inc-cpu

View all activity

OPEA's activity

cicdatopea

in OPEA/deepseek-vl2-int4-sym-gptq-inc about 13 hours ago

ValueError: Invalid modules, at least two modules detected as dependent, {shortest_module} and {longest_module}

#3 opened 4 days ago by

cicdatopea

updated a model about 13 hours ago

OPEA/DeepSeek-V3-int4-sym-gptq-inc

Updated about 13 hours ago • 419 • 10

cicdatopea

updated a model about 14 hours ago

OPEA/DeepSeek-V3-int4-sym-awq-inc-cpu

Updated about 14 hours ago • 7

cicdatopea

updated a model about 18 hours ago

OPEA/deepseek-vl2-int4-sym-gptq-inc

Updated about 18 hours ago • 93

cicdatopea

updated a model about 21 hours ago

OPEA/DeepSeek-V2.5-1210-int4-sym-inc

Updated about 21 hours ago • 66 • 7

cicdatopea

in OPEA/DeepSeek-V3-int4-sym-gptq-inc about 21 hours ago

vllm

#4 opened 5 days ago by

cicdatopea

updated a model 3 days ago

OPEA/llama-joycaption-alpha-two-hf-llava-int4-sym-inc

Updated 3 days ago • 234 • 1

cicdatopea

in OPEA/DeepSeek-V3-int4-sym-gptq-inc 5 days ago

engine

#3 opened 5 days ago by

Base model please!

#2 opened 5 days ago by

Haihao

authored a paper about 1 month ago

A dynamic parallel method for performance optimization on hybrid CPUs

Paper • 2411.19542 • Published Nov 29, 2024 • 5

Haihao

authored 3 papers 3 months ago

Efficient LLM Inference on CPUs

Paper • 2311.00502 • Published Nov 1, 2023 • 7

Effective Quantization for Diffusion Models on CPUs

Paper • 2311.16133 • Published Nov 2, 2023 • 4

Fast DistilBERT on CPUs

Paper • 2211.07715 • Published Oct 27, 2022

ibrahimhaddad

authored 2 papers 6 months ago

The Model Openness Framework: Promoting Completeness and Openness for Reproducibility, Transparency, and Usability in Artificial Intelligence

Paper • 2403.13784 • Published Mar 20, 2024

CLAIMED -- the open source framework for building coarse-grained operators for accelerated discovery in science

Paper • 2307.06824 • Published Jul 12, 2023

ashahba

authored a paper 9 months ago

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Paper • 2404.12241 • Published Apr 18, 2024 • 10

Haihao

authored a paper about 1 year ago

TEQ: Trainable Equivalent Transformation for Quantization of LLMs

Paper • 2310.10944 • Published Oct 17, 2023 • 9

Haihao

authored 3 papers over 1 year ago

Efficient Post-training Quantization with FP8 Formats

Paper • 2309.14592 • Published Sep 26, 2023 • 10

Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs

Paper • 2309.05516 • Published Sep 11, 2023 • 9

An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs

Paper • 2306.16601 • Published Jun 28, 2023 • 4