nclc.org
robots.txt
Robots Exclusion Standard data for nclc.org
Resource Scan
Scan Details
Site Domain | nclc.org |
Base Domain | nclc.org |
Scan Status | Ok |
Last Scan | 2025-09-16T06:08:16+00:00 |
Next Scan | 2025-10-16T06:08:16+00:00 |
Last Scan
Scanned | 2025-09-16T06:08:16+00:00 |
URL | https://nclc.org/robots.txt |
Redirect | https://www.nclc.org/robots.txt |
Redirect Domain | www.nclc.org |
Redirect Base | nclc.org |
Domain IPs | 104.26.12.18, 104.26.13.18, 172.67.72.66, 2606:4700:20::681a:c12, 2606:4700:20::681a:d12, 2606:4700:20::ac43:4842 |
Redirect IPs | 104.26.12.18, 104.26.13.18, 172.67.72.66, 2606:4700:20::681a:c12, 2606:4700:20::681a:d12, 2606:4700:20::ac43:4842 |
Response IP | 104.26.12.18 |
Found | Yes |
Hash | 04ea5388f0ecffd25317579b24f2af54b98b4aaddf858cb78bdd08d1fae6ac28 |
SimHash | 49004840e533 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp/wp-admin/ |
Allow | /wp/wp-admin/admin-ajax.php |
*
Rule | Path |
---|---|
Disallow |
Other Records
Field | Value |
---|---|
sitemap | https://www.nclc.org/sitemap_index.xml |
Comments