Training

Projects

TorchSparse: Efficient Point Cloud Inference Engine

MLSys 2022

TorchSparse is a high-performance computing library for efficient 3D sparse convolution. This library aims at accelerating sparse computation in 3D, in particular the Sparse Convolution operation.

On-Device Training Under 256KB Memory

NeurIPS 2022

(

)

In MCUNetV3, we enable on-device training under 256KB memory, using less than 1/1000 memory of PyTorch while matching the accuracy on the visual wake words application using system-algorithm co-design.

Network Augmentation for Tiny Deep Learning

ICLR 2022

(

)

NetAug is a training technique for tiny neural networks. NetAug embeds the tiny neural networks into larger neural networks as a sub-network to get more guidance during training. NetAug consistently improves the performance of tiny models, achieving up to 2.2% accuracy improvement on ImageNet.

Delayed Gradient Averaging: Tolerate the Communication Latency in Federated Learning

NeurIPS 2021

(

)

We propose Delayed Gradient Averaging (DGA), which delays the averaging step to improve efficiency and allows local computation in parallel to communication.

Differentiable Augmentation for Data-Efficient GAN Training

NeurIPS 2020

(

)

Differentiable augmentation to improve the data efficiency of GAN training.

TinyTL: Reduce Activations, Not Trainable Parameters for Efficient On-Device Learning

NeurIPS 2020

(

)

Tiny-Transfer-Learning (TinyTL) provides memory-efficient on-device learning by freezing the weights while only learns the bias modules to get rid of the intermediate activations, and introducing the lite residual module to maintain the adaptation capacity.

Blog Posts

On-Device Training Under 256KB Memory

November 28, 2022

In MCUNetV3, we enable on-device training under 256KB SRAM and 1MB Flash, using less than 1/1000 memory of PyTorch while matching the accuracy on the visual wake words application. It enables the model to adapt to newly collected sensor data and users can enjoy customized services without uploading the data to the cloud thus protecting privacy.