2025---- 中科院计算技术研究所--自然语言处理研究组网站

1.

Shoutao Guo, Shaolei Zhang, Qingkai Fang, Zhengrui Ma, Min Zhang, Yang Feng. FastLongSpeech: Enhancing Large Speech-Language Models for Efficient Long-Speech Processing. Thirty-Ninth Conference on Neural Information Processing Systems (NeurIPS 2025). December 2nd to December 7th, San Diego, US. pdf

pdf pdf

2.

Zhengrui Ma, Yang Feng, Chenze Shao, Fandong Meng, Jie Zhou, Min Zhang.Efficient Speech Language Modeling via Energy Distance in Continuous Latent Space. Thirty-Ninth Conference on Neural Information Processing Systems (NeurIPS 2025). December 2nd to December 7th, San Diego, US. pdf

pdf pdf

3.

Kangyu Qiao, Shaolei Zhang, Yang Feng. IG-Pruning: Input-Guided Block Pruning for Large Language Models. The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025) November 4th to November 9th, 2025, Suzhou, China. pdf

pdf pdf

4.

Mengyu Bu, Shaolei Zhang, Zhongjun He, Hua Wu, Yang Feng. AlignX: Advancing Multilingual Large Language Models with Multilingual Representation Alignment. The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025), November 4th to November 9th, 2025, Suzhou, China. pdf

pdf pdf

5.

张绍磊, 冯洋. 基于连接时序分类解码器的实时语音翻译方法[J]. 计算机学报, 2025, 0-16. pdf

pdf pdf

6.

张绍磊, 冯洋. 实时翻译研究综述[J]. 中文信息学报, 2025, 0-21. pdf

pdf pdf

7.

Andong Chen, Kehai Chen, Yang Xiang, Xuefeng Bai, Muyun Yang, Yang Feng, Tiejun Zhao, Min Zhang. LLM-based Translation Inference with Iterative Bilingual Understanding. The 63rd Annual Meeting of the Association for Computational Linguistics (Findings of ACL 2025), July 27–August 1st, 2025, Vienna, Austria, pdf

pdf pdf

8.

Zhuocheng Zhang, Yang Feng, Min Zhang. FlexRAG: A Flexible and Comprehensive Framework for Retrieval-Augmented Generation. The 63rd Annual Meeting of the Association for Computational Linguistics (System Demonstration of ACL 2025), July 27–August 1st, 2025, Vienna, Austria, pdf

pdf pdf

9.

Qingkai Fang, Yan Zhou, Shoutao Guo, Shaolei Zhang, Yang Feng. LLaMA-Omni 2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis. The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025), July 27–August 1st, 2025, Vienna, Austria, pdf

pdf pdf

10.

Zhengrui Ma, Yang Feng, Min Zhang. Overcoming Non-monotonicity in Transducer-based Streaming Generation. The Forty-Second International Conference on Machine Learning（ICML 2025) , July 13th-July 19th, 2025, Vancouver, BC, Canada, pdf

pdf pdf

11.

Shoutao Guo, Shaolei Zhang, Zhengrui Ma, Min Zhang, Yang Feng. Agent-SiMT: Agent-assisted Simultaneous Translation with Large Language Models. IEEE Transactions on Audio, Speech and Language Processing (TASLP), 1-10, pdf

pdf pdf

12.

Shaolei Zhang, Shoutao Guo, Qingkai Fang, Yan Zhou, Yang Feng. Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model. arXiv, Jun 16, 2025, pdf

pdf pdf

13.

Langlin Huang, Mengyu Bu, Yang Feng. MoCE:Adaptive Mixture of Contextualization Experts for Byte-based Neural Machine Translation. 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics. (NAACL 2025), April 29–May 4, 2025，1011–1028, Albuquerque, New Mexico, pdf

pdf pdf

14.

Qingkai Fang, Shoutao Guo, Yan Zhou, Zhengrui Ma, Shaolei Zhang, Yang Feng. LLaMA-Omni: Seamless Speech Interaction with Large Language Models. The 13th International Conference on Learning Representations (ICLR 2025), April 24-28, 2025, Singapore, pdf

pdf pdf

15.

Shaolei Zhang, Qingkai Fang, Zhe Yang, Yang Feng. LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token. The 13th International Conference on Learning Representations (ICLR 2025), April 24-28, 2025, Singapore, pdf

pdf pdf

16.

Zhuocheng Zhang, Yang Feng, Min Zhang. LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented Searchers. arXiv, Feb 25, 2025, pdf

pdf pdf

17.

Shoutao Guo, Shaolei Zhang, Zhengrui Ma, Yang Feng. Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation. Thirty-Ninth AAAI Conference on Artificial Intelligence (AAAI 2025), February 25-March 4, 2025, 23969-23977, Pennsylvania, America, pdf

pdf pdf