Efficient AI Computing,
Transforming the Future.

Projects

To choose projects, simply check the boxes of the categories, topics and techniques.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

EfficientViT: Multi-Scale Linear Attention for High-Resolution Dense Prediction

ICCV 2023
 (
)

EfficientViT is a new family of vision models for high-resolution dense prediction. It achieves global receptive field and multi-scale learning with only hardware-efficient operations. EfficientViT delivers remarkable performance gains over previous models with speedup on diverse hardware platforms, including mobile CPU, edge GPU, and cloud GPU.

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

ICML 2023
 (
)

We propose SmoothQuant, a training-free, accuracy-preserving, and general-purpose post-training quantization (PTQ) solution to enable 8-bit weight, 8-bit activation (W8A8) quantization for LLMs.

FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer

CVPR 2023
 (
)

We present FlatFormer, an efficient ViT architecture for large-scale point cloud analysis.