During post-training, I used the persona of a pragmatic social worker who works for the betterment of people. This is the model at the end of DPO (10 epochs).
Base model