nwanews.com
robots.txt
Robots Exclusion Standard data for nwanews.com
Resource Scan
Scan Details
Site Domain | nwanews.com |
Base Domain | nwanews.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2024-10-27T16:13:56+00:00 |
Next Scan | 2025-01-25T16:13:56+00:00 |
Last Successful Scan
Scanned | 2024-04-01T16:12:23+00:00 |
URL | http://nwanews.com/robots.txt |
Redirect | https://www.nwaonline.com/robots.txt |
Redirect Domain | www.nwaonline.com |
Redirect Base | nwaonline.com |
Domain IPs | 208.91.60.191 |
Redirect IPs | 104.26.14.18, 104.26.15.18, 172.67.71.53, 2606:4700:20::681a:e12, 2606:4700:20::681a:f12, 2606:4700:20::ac43:4735 |
Response IP | 172.67.71.53 |
Found | Yes |
Hash | 863672ac5fe4b5bb66dd204735aaf3cc2a48bccafc837a552d0a28e4f43e0042 |
SimHash | 679f736a6fc1 |
Groups
*
Rule | Path |
---|---|
Disallow | /assets/ |
Disallow | /blaize/datalayer/ |
Disallow | /cgi-bin/ |
Disallow | /content/right2know/salaries/ |
Disallow | /content/right2know/salaries/search/ |
Disallow | /plugins/public/treasure-data-cdp/user-profile/ |
Disallow | /puzzles/ |
Warnings
- 1 invalid line.