You can exclude page URLs by modifying your sitemap. however, I don’t think this is the correct approach.
You can include individual webpages using the one-time training dataset “webpage,” but if your website is constantly updating, you will lose the real-time synchronization that you get by using the sitemap Website dataset.