Modal Labs
Modal makes it easy to run code in the cloud.
Pinned Loading
Repositories
Showing 10 of 69 repositories
- flash-attention Public Forked from Dao-AILab/flash-attention
Fast and memory-efficient exact attention
modal-labs/flash-attention’s past year of commit activity - tokio-par-util Public
An utility crate for scheduling Rust streams/futures in parallel on Tokio according to structured parallelism best-practices
modal-labs/tokio-par-util’s past year of commit activity - multinode-training-guide Public
Well documented examples of running distributed training jobs on Modal
modal-labs/multinode-training-guide’s past year of commit activity - sglang Public Forked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
modal-labs/sglang’s past year of commit activity - vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
modal-labs/vllm’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…