Publications
2025
1.

Shoutao Guo, Shaolei Zhang, Qingkai Fang, Zhengrui Ma, Min Zhang, Yang Feng. FastLongSpeech: Enhancing Large Speech-Language Models for Efficient Long-Speech Processing. Thirty-Ninth Conference on Neural Information Processing Systems (NeurIPS 2025). December 2nd to December 7th, San Diego, US.

pdfpdf
2.

Zhengrui Ma, Yang Feng, Chenze Shao, Fandong Meng, Jie Zhou, Min Zhang.Efficient Speech Language Modeling via Energy Distance in Continuous Latent Space. Thirty-Ninth Conference on Neural Information Processing Systems (NeurIPS 2025). December 2nd to December 7th, San Diego, US.

pdfpdf
3.

Kangyu Qiao, Shaolei Zhang, Yang Feng. IG-Pruning: Input-Guided Block Pruning for Large Language Models. The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025) November 4th to November 9th, 2025, Suzhou, China.

pdfpdf
4.

Mengyu Bu, Shaolei Zhang, Zhongjun He, Hua Wu, Yang Feng. AlignX: Advancing Multilingual Large Language Models with Multilingual Representation Alignment. The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025), November 4th to November 9th, 2025, Suzhou, China.

pdfpdf
5.

张绍磊, 冯洋. 基于连接时序分类解码器的实时语音翻译方法[J]. 计算机学报, 2025, 0-16. pdf

pdfpdf
6.

张绍磊, 冯洋. 实时翻译研究综述[J]. 中文信息学报, 2025, 0-21.

pdfpdf
7.

Andong Chen, Kehai Chen, Yang Xiang, Xuefeng Bai, Muyun Yang, Yang Feng, Tiejun Zhao, Min Zhang. LLM-based Translation Inference with Iterative Bilingual Understanding. The 63rd Annual Meeting of the Association for Computational Linguistics (Findings of ACL 2025),July 27–August 1st, 2025, Vienna, Austria, pdf

pdfpdf
8.

Zhuocheng Zhang, Yang Feng, Min Zhang. FlexRAG: A Flexible and Comprehensive Framework for Retrieval-Augmented Generation. The 63rd Annual Meeting of the Association for Computational Linguistics (System Demonstration of ACL 2025), July 27–August 1st, 2025, Vienna, Austria, pdf

pdfpdf
9.

Qingkai Fang, Yan Zhou, Shoutao Guo, Shaolei Zhang, Yang Feng. LLaMA-Omni 2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis. The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025), July 27–August 1st, 2025, Vienna, Austria, pdf

pdfpdf
10.

Zhengrui Ma, Yang Feng, Min Zhang. Overcoming Non-monotonicity in Transducer-based Streaming Generation. The Forty-Second International Conference on Machine Learning(ICML 2025) , July 13th-July 19th, 2025, Vancouver, BC, Canada, pdf

pdfpdf
11.

Shoutao Guo, Shaolei Zhang, Zhengrui Ma, Min Zhang, Yang Feng. Agent-SiMT: Agent-assisted Simultaneous Translation with Large Language Models.  IEEE Transactions on Audio, Speech and Language Processing (TASLP), 1-10, pdf

pdfpdf
12.

Shaolei Zhang, Shoutao Guo, Qingkai Fang, Yan Zhou, Yang Feng. Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model. arXiv, Jun 16, 2025, pdf

pdfpdf
13.

Zhengrui Ma, Yang Feng, Chenze Shao, Fandong Meng, Jie Zhou, Min Zhang. Efficient Speech Language Modeling via Energy Distance in Continuous Latent Space. arXiv, May 19, 2025, pdf

pdfpdf
14.

Langlin Huang, Mengyu Bu, Yang Feng. MoCE:Adaptive Mixture of Contextualization Experts for Byte-based Neural Machine Translation. 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics. (NAACL 2025), April 29–May 4, 2025,1011–1028, Albuquerque, New Mexico, pdf

pdfpdf
15.

Qingkai Fang, Shoutao Guo, Yan Zhou, Zhengrui Ma, Shaolei Zhang, Yang Feng.  LLaMA-Omni: Seamless Speech Interaction with Large Language Models. The 13th International Conference on Learning Representations(ICLR 2025), April 24-28, 2025, Singapore, pdf

pdfpdf
16.

Shaolei Zhang, Qingkai Fang, Zhe Yang, Yang Feng. LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token. The 13th International Conference on Learning Representations(ICLR 2025), April 24-28, 2025, Singapore, pdf

pdfpdf
17.

Zhuocheng Zhang, Yang Feng, Min Zhang. LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented Searchers. arXiv, Feb 25, 2025, pdf

pdfpdf
18.

Shoutao Guo, Shaolei Zhang, Zhengrui Ma, Yang Feng. Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation. Thirty-Ninth AAAI Conference on Artificial Intelligence (AAAI 2025), February 25-March 4, 2025, 23969-23977, Pennsylvania, America, pdf

pdfpdf