zhentaoyu

Follow

🎯

Focusing

zhentaoyu zhentaoyu

🎯

Focusing

Follow

2 followers · 14 following

intel
Shanghai
densecollections.top

Achievements

Achievements

Pinned Loading

intel/neural-speed intel/neural-speed Public archive

An innovative library for efficient LLM inference via low-bit quantization

C++ 342 35
intel/intel-extension-for-transformers intel/intel-extension-for-transformers Public

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Python 2.1k 207
ggerganov/llama.cpp ggerganov/llama.cpp Public

LLM inference in C/C++

C++ 65.1k 9.3k
leejet/stable-diffusion.cpp leejet/stable-diffusion.cpp Public

Stable Diffusion and Flux in pure C/C++

C++ 3.3k 273
huggingface/optimum-habana huggingface/optimum-habana Public

Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)

Python 146 183
intel/neural-compressor intel/neural-compressor Public

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Python 2.2k 252