Personal Workspace
Preview
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Tag preview
3 preview projects.
A high-throughput and memory-efficient inference and serving engine for LLMs
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate