Publications
2025
1.

Shoutao Guo, Shaolei Zhang, Zhengrui Ma, Min Zhang, Yang Feng. Agent-SiMT: Agent-assisted Simultaneous Translation with Large Language Models.  IEEE Transactions on Audio, Speech and Language Processing (TASLP), 1-10, pdf

pdfpdf
2.

Langlin Huang, Mengyu Bu, Yang Feng. MoCE:Adaptive Mixture of Contextualization Experts for Byte-based Neural Machine Translation. 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics. (NAACL 2025), April 29–May 4, 2025,1011–1028, Albuquerque, New Mexico, pdf

pdfpdf
3.

Qingkai Fang, Shoutao Guo, Yan Zhou, Zhengrui Ma, Shaolei Zhang, Yang Feng.  LLaMA-Omni: Seamless Speech Interaction with Large Language Models. The 13th International Conference on Learning Representations(ICLR 2025), April 24-28, 2025, Singapore, pdf

pdfpdf
4.

Shaolei Zhang, Qingkai Fang, Zhe Yang, Yang Feng. LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token. The 13th International Conference on Learning Representations(ICLR 2025), April 24-28, 2025, Singapore, pdf

pdfpdf
5.

Shoutao Guo, Shaolei Zhang, Zhengrui Ma, Yang Feng. Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation. Thirty-Ninth AAAI Conference on Artificial Intelligence (AAAI 2025), February 25-March 4, 2025, 23969-23977, Pennsylvania, America, pdf

pdfpdf