Jiaming Tang is a visiting student at MIT HAN Lab. He is an undergraduate at ACM Honors Class, Shanghai Jiao Tong University. His research interests lie in efficient systems and algorithms for large language models.