Browsing: Reinforcement Learning from Human Feedback