BPO: Towards Balanced Preference Optimization between Knowledge Breadth and Depth in Alignment

Published in NAACL 2025, 2025