theherdnow.com
robots.txt

Robots Exclusion Standard data for theherdnow.com

Resource Scan

Scan Details

Site Domain theherdnow.com
Base Domain theherdnow.com
Scan Status Ok
Last Scan2024-11-15T01:16:37+00:00
Next Scan 2024-11-22T01:16:37+00:00

Last Scan

Scanned2024-11-15T01:16:37+00:00
URL https://www.theherdnow.com/robots.txt
Domain IPs 199.232.210.193, 199.232.214.193
Response IP 199.232.46.193
Found Yes
Hash d46f322a597e98ef02e8c8879af296e42b4c0cda3350bd8e21e12aee15109ad2
SimHash b3135cc4efd0

Groups

mediapartners-google*
*

Rule Path
Disallow /search/*
Disallow /calendar/ajaxcall/
Disallow /static/
Disallow /api/*
Disallow /_debug/*
Disallow /_preview
Disallow /eyes-to-ears/*
Disallow /newsletter/embed/*
Disallow /contact/send*
Disallow /text/*
Disallow /alternate/amp/stats.html

Other Records

Field Value
sitemap https://www.theherdnow.com/sitemap.xml