npca.org
robots.txt
Robots Exclusion Standard data for npca.org
Resource Scan
Scan Details
Site Domain | npca.org |
Base Domain | npca.org |
Scan Status | Ok |
Last Scan | 2024-10-04T02:48:50+00:00 |
Next Scan | 2024-11-03T02:48:50+00:00 |
Last Scan
Scanned | 2024-10-04T02:48:50+00:00 |
URL | https://npca.org/robots.txt |
Redirect | https://www.npca.org/robots.txt |
Redirect Domain | www.npca.org |
Redirect Base | npca.org |
Domain IPs | 89.106.200.1 |
Redirect IPs | 23.22.5.68, 3.226.182.14, 52.21.227.162, 54.237.159.171 |
Response IP | 52.21.227.162 |
Found | Yes |
Hash | 6649c2ba68c45fb0a2fd57b7e5a2977ecdc96f98f4a38a34d86b048dc51350b0 |
SimHash | e001c474cf95 |
Groups
*
Rule | Path |
---|---|
Disallow | /prototype |
Disallow | /prototype/* |
Disallow | /admin |
Disallow | /admin/* |
Disallow | /api |
Disallow | /api/* |
Disallow | /fpc |
Disallow | /fpc/* |
Disallow | /400 |
Disallow | /404 |
Disallow | /406 |
Disallow | /422 |
Disallow | /500 |
Disallow | /503 |
Disallow | /504 |
Other Records
Field | Value |
---|---|
sitemap | https://npca.s3.amazonaws.com/sitemap.xml |
sitemap | https://npca.s3.amazonaws.com/google-news-sitemap.xml |