newswire.com
robots.txt
Robots Exclusion Standard data for newswire.com
Resource Scan
Scan Details
Site Domain | newswire.com |
Base Domain | newswire.com |
Scan Status | Ok |
Last Scan | 2024-11-07T08:55:06+00:00 |
Next Scan | 2024-11-21T08:55:06+00:00 |
Last Scan
Scanned | 2024-11-07T08:55:06+00:00 |
URL | https://newswire.com/robots.txt |
Redirect | https://www.newswire.com/robots.txt |
Redirect Domain | www.newswire.com |
Redirect Base | newswire.com |
Domain IPs | 35.226.170.102, 35.238.119.212 |
Redirect IPs | 104.21.33.95, 172.67.189.128, 2606:4700:3030::ac43:bd80, 2606:4700:3031::6815:215f |
Response IP | 104.21.33.95 |
Found | Yes |
Hash | c0f8eb284b790b040eb058fc73c0bc8c0a03ee64f227650151d4fa4b2d82bd19 |
SimHash | 43bf524ac3d3 |
Groups
*
Rule | Path |
---|---|
Disallow | /duda/ |
Disallow | /admin/ |
Disallow | /manage/ |
Disallow | /reseller/ |
Disallow | /order |
Disallow | /browse/tag/ |
Disallow | /newsroom/tag/ |
Disallow | /news-center/tag/ |
Disallow | /newsroom/rss/tag/ |
Disallow | /news-center/rss/tag/ |
Other Records
Field | Value |
---|---|
crawl-delay | 5 |
Other Records
Field | Value |
---|---|
sitemap | https://www.newswire.com/sitemap/sitemap-index.xml |
Comments