Publications
2025
1.

张绍磊, 冯洋. 基于连接时序分类解码器的实时语音翻译方法[J]. 计算机学报, 2025, 0-16. pdf

pdfpdf
2.

张绍磊, 冯洋. 实时翻译研究综述[J]. 中文信息学报, 2025, 0-21.

pdfpdf
3.

Andong Chen, Kehai Chen, Yang Xiang, Xuefeng Bai, Muyun Yang, Yang Feng, Tiejun Zhao, Min Zhang. LLM-based Translation Inference with Iterative Bilingual Understanding. The 63rd Annual Meeting of the Association for Computational Linguistics (Findings of ACL 2025),July 27–August 1st, 2025, Vienna, Austria, pdf

pdfpdf
4.

Zhuocheng Zhang, Yang Feng, Min Zhang. FlexRAG: A Flexible and Comprehensive Framework for Retrieval-Augmented Generation. The 63rd Annual Meeting of the Association for Computational Linguistics (System Demonstration of ACL 2025), July 27–August 1st, 2025, Vienna, Austria, pdf

pdfpdf
5.

Qingkai Fang, Yan Zhou, Shoutao Guo, Shaolei Zhang, Yang Feng. LLaMA-Omni 2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis. The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025), July 27–August 1st, 2025, Vienna, Austria, pdf

pdfpdf
6.

Zhengrui Ma, Yang Feng, Min Zhang. Overcoming Non-monotonicity in Transducer-based Streaming Generation. The Forty-Second International Conference on Machine Learning(ICML 2025) , July 13th-July 19th, 2025, Vancouver, BC, Canada, pdf

pdfpdf
7.

Shoutao Guo, Shaolei Zhang, Zhengrui Ma, Min Zhang, Yang Feng. Agent-SiMT: Agent-assisted Simultaneous Translation with Large Language Models.  IEEE Transactions on Audio, Speech and Language Processing (TASLP), 1-10, pdf

pdfpdf
8.

Shaolei Zhang, Shoutao Guo, Qingkai Fang, Yan Zhou, Yang Feng. Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model. arXiv, Jun 16, 2025, pdf

pdfpdf
9.

Zhengrui Ma, Yang Feng, Chenze Shao, Fandong Meng, Jie Zhou, Min Zhang. Efficient Speech Language Modeling via Energy Distance in Continuous Latent Space. arXiv, May 19, 2025, pdf

pdfpdf
10.

Langlin Huang, Mengyu Bu, Yang Feng. MoCE:Adaptive Mixture of Contextualization Experts for Byte-based Neural Machine Translation. 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics. (NAACL 2025), April 29–May 4, 2025,1011–1028, Albuquerque, New Mexico, pdf

pdfpdf
11.

Qingkai Fang, Shoutao Guo, Yan Zhou, Zhengrui Ma, Shaolei Zhang, Yang Feng.  LLaMA-Omni: Seamless Speech Interaction with Large Language Models. The 13th International Conference on Learning Representations(ICLR 2025), April 24-28, 2025, Singapore, pdf

pdfpdf
12.

Shaolei Zhang, Qingkai Fang, Zhe Yang, Yang Feng. LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token. The 13th International Conference on Learning Representations(ICLR 2025), April 24-28, 2025, Singapore, pdf

pdfpdf
13.

Zhuocheng Zhang, Yang Feng, Min Zhang. LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented Searchers. arXiv, Feb 25, 2025, pdf

pdfpdf
14.

Shoutao Guo, Shaolei Zhang, Zhengrui Ma, Yang Feng. Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation. Thirty-Ninth AAAI Conference on Artificial Intelligence (AAAI 2025), February 25-March 4, 2025, 23969-23977, Pennsylvania, America, pdf

pdfpdf