yunwei37

🏠

Working from home

云微 yunwei37

🏠

Working from home

Stupid but Curious about the world | 系统算命师傅 | vibe coding 厕所清洁工 | eunomia.dev

1.1k followers · 910 following

Achievements

x3 x3 x3

Achievements

x3 x3 x3

Highlights

Developer Program Member
Pro

Organizations

Lists (1)

Sort

profiler

Starred repositories

Satyam-git-hub / Shared_socket

An eBPF based tool for implementing socket based communication to shared memory

C 4 Updated Jun 26, 2025

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,494 483 Updated Dec 27, 2025

binpash / try

Inspect a command's effects before modifying your live system

Shell 5,377 76 Updated Dec 28, 2025

os-for-agent / os-for-agent.github.io

AgentOS Workshop for ASPLOS'26

HTML 4 Updated Dec 16, 2025

microsoft / jbpf

Userspace eBPF instrumentation and control framework for deploying control and monitoring functions in a secure manner

C 69 18 Updated Aug 19, 2025

microsoft / vattention

Dynamic Memory Management for Serving LLMs without PagedAttention

C 453 34 Updated May 30, 2025

pkusys / Rummy

GPU-accelerated vector query processing system that supports large vector datasets beyond GPU memory.

C++ 38 9 Updated Mar 24, 2024

eunomia-bpf / gpu_ext

eBPF for GPU UVM offloading and scheduling in Linux kernel

C 11 Updated Dec 19, 2025

tgoodwin / monolift

monolith modernizer

Go 5 Updated Jul 30, 2025

sys-intelligence / system-intelligence-benchmark

System Intelligence Benchmark

TLA 37 5 Updated Dec 22, 2025

facebookresearch / faiss

A library for efficient similarity search and clustering of dense vectors.

C++ 38,549 4,167 Updated Dec 26, 2025

GPUOpen-LibrariesAndSDKs / Orochi

C++ 246 38 Updated Nov 4, 2025

Arize-ai / phoenix

AI Observability & Evaluation

Jupyter Notebook 8,049 661 Updated Dec 27, 2025

eniac / paella

Paella: Low-latency Model Serving with Virtualized GPU Scheduling

C++ 66 6 Updated May 1, 2024

cache-ext / cache_ext

cache_ext is a framework to customize Linux page cache eviction policies using BPF. Appeared in SOSP 2025.

Jupyter Notebook 61 12 Updated Dec 12, 2025

SJTU-IPADS / disb

DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.

C++ 57 7 Updated Aug 21, 2024

NVIDIA / gdrcopy

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 1,301 180 Updated Dec 17, 2025

thustorage / GPreempt

Jupyter Notebook 17 2 Updated May 18, 2025

google / kafel

A language and library for specifying syscall filtering policies.

C 341 53 Updated Nov 22, 2025

cvc5 / cvc5

cvc5 is an open-source automatic theorem prover for Satisfiability Modulo Theories (SMT) problems.

SMT 1,238 271 Updated Dec 22, 2025

OSU-STARLAB / UVM_benchmark

Roff 32 9 Updated Sep 9, 2020

aliyun / SimAI

C++ 776 127 Updated Oct 29, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Python 4,373 617 Updated Dec 28, 2025

NVIDIA / cuCollections

C++ 610 104 Updated Dec 20, 2025

agentclientprotocol / python-sdk

Python SDK for ACP clients and agents.

Python 107 7 Updated Dec 27, 2025

nnurlan008 / dream

This is the public repo for DREAM accepted to ICS 2025.

C++ 8 1 Updated Dec 28, 2025

pkusys / TGS

Artifacts for our NSDI'23 paper TGS

Python 95 28 Updated Jun 10, 2024

nicexlab / GeminiFS

GeminiFS: A Companion File System for GPUs

C++ 69 11 Updated Feb 18, 2025

RC4ML / Hyperion

Cost-efficient Out-of-core GNN Training System on TB-scale Graph [ICDE 25]

Cuda 22 4 Updated Jan 6, 2025

mirage-project / mirage

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 2,012 162 Updated Dec 20, 2025

云微 yunwei37

Sponsors

Highlights

Organizations

Lists (1)

profiler

Starred repositories

Database

static-analysis

Git

Operating system

Rust

Docker

C

Android

C++

Compiler