New Book Details Reinforcement Learning from Human Feedback for LLM Alignment · Digg