This is the official repository for paper RM-Distiller: Exploiting Generative LLM for Reward Model Distillation. In this paper, we introduce RM-Distiller, a framework designed to distill ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results