Skip to content

Pinned Loading

  1. streaming-llm streaming-llm Public

    [ICLR 2024] Efficient Streaming Language Models with Attention Sinks

    Python 6.9k 386

  2. llm-awq llm-awq Public

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    Python 3.1k 260

  3. efficientvit efficientvit Public

    Efficient vision foundation models for high-resolution generation and perception.

    Python 3k 228

  4. bevfusion bevfusion Public archive

    [ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

    Python 2.7k 481

  5. temporal-shift-module temporal-shift-module Public

    [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

    Python 2.1k 419

  6. once-for-all once-for-all Public

    [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

    Python 1.9k 343

Repositories

Showing 10 of 62 repositories
  • radial-attention Public

    Radial Attention Official Implementation

    mit-han-lab/radial-attention’s past year of commit activity
    Python 254 Apache-2.0 10 6 0 Updated Jul 3, 2025
  • lpd Public

    Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation

    mit-han-lab/lpd’s past year of commit activity
    Python 24 MIT 1 0 0 Updated Jul 3, 2025
  • nunchaku Public

    [ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

    mit-han-lab/nunchaku’s past year of commit activity
    Python 2,238 Apache-2.0 116 64 8 Updated Jul 2, 2025
  • ComfyUI-nunchaku Public

    ComfyUI plugin of Nunchaku

    mit-han-lab/ComfyUI-nunchaku’s past year of commit activity
    Python 1,503 Apache-2.0 44 59 3 Updated Jun 29, 2025
  • x-attention Public

    XAttention: Block Sparse Attention with Antidiagonal Scoring

    mit-han-lab/x-attention’s past year of commit activity
    Python 170 10 2 0 Updated Jun 28, 2025
  • torchquantum Public

    A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.

    mit-han-lab/torchquantum’s past year of commit activity
    Jupyter Notebook 1,501 MIT 224 61 (4 issues need help) 8 Updated Jun 28, 2025
  • llm-awq Public

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    mit-han-lab/llm-awq’s past year of commit activity
    Python 3,123 MIT 260 159 13 Updated Jun 12, 2025
  • vila-u Public

    [ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

    mit-han-lab/vila-u’s past year of commit activity
    Python 357 MIT 12 20 0 Updated Apr 25, 2025
  • efficientvit Public

    Efficient vision foundation models for high-resolution generation and perception.

    mit-han-lab/efficientvit’s past year of commit activity
    Python 2,953 Apache-2.0 227 107 0 Updated Apr 24, 2025
  • deepcompressor Public

    Model Compression Toolbox for Large Language Models and Diffusion Models

    mit-han-lab/deepcompressor’s past year of commit activity
    Python 518 Apache-2.0 39 55 1 Updated Mar 28, 2025