bleachernation.com
robots.txt

Robots Exclusion Standard data for bleachernation.com

Resource Scan

Scan Details

Site Domain bleachernation.com
Base Domain bleachernation.com
Scan Status Ok
Last Scan2024-11-03T16:57:22+00:00
Next Scan 2024-11-10T16:57:22+00:00

Last Scan

Scanned2024-11-03T16:57:22+00:00
URL https://bleachernation.com/robots.txt
Domain IPs 104.26.2.234, 104.26.3.234, 172.67.72.19, 2606:4700:20::681a:2ea, 2606:4700:20::681a:3ea, 2606:4700:20::ac43:4813
Response IP 104.26.2.234
Found Yes
Hash 60e06c541993e5780b9a08cd97ab6fda0994e37dda474646426455bcb7d3cfd8
SimHash 7f015cb0ac92

Groups

*

Rule Path Comment
Disallow /wp-admin/ block access to admin section
Allow /wp-admin/admin-ajax.php -
Disallow /wp-login.php block access to admin section variant two
Disallow *?s=* block access to internal search result pages
Disallow *?s&cat=* block access to internal search result pages variant two
Disallow *?_page=* block access to post pagination
Disallow *%26preview%3D* block access to preview pages
Disallow *?utm_campaign=* block access to UTM tracking
Disallow /page/*?*utm_source= -
Disallow /go/ -

facebookexternalhit

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.bleachernation.com/sitemap_index.xml

Comments

  • Block access to non-indexable URLs
  • Block Sportsbook affiliate links from being crawled
  • Allow Facebook Crawler