Skip to content
View yunwei37's full-sized avatar
🏠
Working from home
🏠
Working from home

Sponsors

@lazyparser
Private Sponsor

Organizations

@eunomia-bpf

Block or report yunwei37

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

An eBPF based tool for implementing socket based communication to shared memory

C 4 Updated Jun 26, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,494 483 Updated Dec 27, 2025

Inspect a command's effects before modifying your live system

Shell 5,377 76 Updated Dec 28, 2025

AgentOS Workshop for ASPLOS'26

HTML 4 Updated Dec 16, 2025

Userspace eBPF instrumentation and control framework for deploying control and monitoring functions in a secure manner

C 69 18 Updated Aug 19, 2025

Dynamic Memory Management for Serving LLMs without PagedAttention

C 453 34 Updated May 30, 2025

GPU-accelerated vector query processing system that supports large vector datasets beyond GPU memory.

C++ 38 9 Updated Mar 24, 2024

eBPF for GPU UVM offloading and scheduling in Linux kernel

C 11 Updated Dec 19, 2025

monolith modernizer

Go 5 Updated Jul 30, 2025

System Intelligence Benchmark

TLA 37 5 Updated Dec 22, 2025

A library for efficient similarity search and clustering of dense vectors.

C++ 38,549 4,167 Updated Dec 26, 2025

AI Observability & Evaluation

Jupyter Notebook 8,049 661 Updated Dec 27, 2025

Paella: Low-latency Model Serving with Virtualized GPU Scheduling

C++ 66 6 Updated May 1, 2024

cache_ext is a framework to customize Linux page cache eviction policies using BPF. Appeared in SOSP 2025.

Jupyter Notebook 61 12 Updated Dec 12, 2025

DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.

C++ 57 7 Updated Aug 21, 2024

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 1,301 180 Updated Dec 17, 2025
Jupyter Notebook 17 2 Updated May 18, 2025

A language and library for specifying syscall filtering policies.

C 341 53 Updated Nov 22, 2025

cvc5 is an open-source automatic theorem prover for Satisfiability Modulo Theories (SMT) problems.

SMT 1,238 271 Updated Dec 22, 2025
C++ 776 127 Updated Oct 29, 2025

FlashInfer: Kernel Library for LLM Serving

Python 4,373 617 Updated Dec 28, 2025
C++ 610 104 Updated Dec 20, 2025

Python SDK for ACP clients and agents.

Python 107 7 Updated Dec 27, 2025

This is the public repo for DREAM accepted to ICS 2025.

C++ 8 1 Updated Dec 28, 2025

Artifacts for our NSDI'23 paper TGS

Python 95 28 Updated Jun 10, 2024

GeminiFS: A Companion File System for GPUs

C++ 69 11 Updated Feb 18, 2025

Cost-efficient Out-of-core GNN Training System on TB-scale Graph [ICDE 25]

Cuda 22 4 Updated Jan 6, 2025

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 2,012 162 Updated Dec 20, 2025
Next