Publications (* indicates equal contribution)

2025

  1. ShifCon: Enhancing Non-Dominant Language Capabilities with a Shift-based Multilingual Contrastive Framework
    Hengyuan Zhang, Chenming Shang, Sizhe Wang, Dongdong Zhang, Yiyao Yu, Feng Yao, Renliang Sun, Yujiu Yang, Furu Wei
    ACL 2025 | arXiv
  2. BPO: Towards Balanced Preference Optimization between Knowledge Breadth and Depth in Alignment
    Sizhe Wang, Yongqi Tong, Hengyuan Zhang, Dawei Li, Xin Zhang, Tianlong Chen
    NAACL 2025 | arXiv

2024

  1. Optimizing Language Model's Reasoning Abilities with Weak Supervision
    Yongqi Tong*, Sizhe Wang*, Dawei Li, Yifan Wang, Simeng Han, Zi Lin, Chengsong Huang, Jiaxin Huang, Jingbo Shang
    Preprint | arXiv
  2. Can LLMs Learn from Previous Mistakes? Investigating LLMs' Errors to Boost for Reasoning
    Yongqi Tong, Dawei Li, Sizhe Wang, Yujia Wang, Fei Teng, Jingbo Shang
    ACL 2024 | arXiv

2023

  1. Eliminating Reasoning via Inferring with Planning: A New Framework to Guide LLMs' Non-linear Thinking
    Yongqi Tong, Yifan Wang, Dawei Li, Sizhe Wang, Zi Lin, Simeng Han, Jingbo Shang
    Preprint | arXiv