Skip to content
@DeepAuto-AI

DeepAuto.ai

Deep Automation for Everyone

Popular repositories Loading

  1. hip-attention hip-attention Public

    Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.

    Python 17 3

  2. vllm-legacy vllm-legacy Public

    Forked vLLM Framework, for DeepAuto Chat Platform. Supports HiP Attention

    Python 1

  3. vllm vllm Public

    Forked from vllm-project/vllm

    Up to 4x faster decoding than vLLM using HiP Attention: https://github.com/DeepAuto-AI/hip-attention

    Python 1

  4. sglang sglang Public

    Forked from sgl-project/sglang

    SGLang is a fast serving framework for large language models and vision language models.

    Python 1

  5. triton triton Public

    Forked from triton-lang/triton

    Development repository for the Triton language and compiler

    C++

Repositories

Showing 5 of 5 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…