Semantic cache for LLMs. Fully integrated with LangChain
Supercharge Your LLM with the Fastest KV Cache Layer
Java wrapper for the popular chat & VOIP service
RGBD video generation model conditioned on camera input
FlashMLA: Efficient Multi-head Latent Attention Kernels
A Model Context Protocol (MCP) Gateway & Registry
Unofficial .Net Client for ChatGPT
Bring the notion of Model-as-a-Service to life
VITS2 backbone with multilingual-bert
Fully private LLM chatbot that runs entirely with a browser
simplest AI programme of tic-tac-toe game
A reactive runtime for building durable AI agents
A webui for different audio related Neural Networks
Snipe Chan is a Discord Bot that snipes deleted/edited messages
Chat with LLM like Vicuna totally in your browser with WebGPU
A modern, web-based photo management server
fastNLP: A Modularized and Extensible NLP Framework
A full-featured home hosted Cloud Drive, Personal Assistant, App Launc
High-performance MoE model with MLA, MTP, and multilingual reasoning