Post
101
Hey if you're reading this and happen to be one of the guys training frontier llms, please penalize 404 urls in your reward functions. Happens too often that these models memorize / make up non-existing url paths and get away with it