Reinforcement Learning from Human Feedback Bookmarks