Efficient AI Computing,
Transforming the Future.

Projects

To choose projects, simply check the boxes of the categories, topics and techniques.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

NAAS: Neural Accelerator Architecture Search

DAC 2021
 (
)

As a data-driven approach, NAAS holistically composes highly matched accelerator and neural architectures together with efficient compiler mapping.

Delayed Gradient Averaging: Tolerate the Communication Latency in Federated Learning

NeurIPS 2021
 (
)

We propose Delayed Gradient Averaging (DGA), which delays the averaging step to improve efficiency and allows local computation in parallel to communication.

MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning

NeurIPS 2021
 (
)

In MCUNetV2, we propose a generic patch-by-patch inference scheduling, which operates only on a small spatial region of the feature map and significantly cuts down the peak memory. We further propose network redistribution to shift the receptive field and FLOPs to the later stage and reduce the computation overhead.

PointAcc: Efficient Point Cloud Accelerator

MICRO 2021
 (
)

PointAcc is a novel point cloud deep learning accelerator. It introduces a configurable sorting-based mapping unit that efficiently supports diverse operations in point cloud networks. PointAcc further exploits simplified caching and layer fusion specialized for point cloud models, effectively reducing the DRAM access.