thepapcorps.org
robots.txt
Robots Exclusion Standard data for thepapcorps.org
Resource Scan
Scan Details
Site Domain | thepapcorps.org |
Base Domain | thepapcorps.org |
Scan Status | Ok |
Last Scan | 2025-08-31T19:32:02+00:00 |
Next Scan | 2025-09-30T19:32:02+00:00 |
Last Scan
Scanned | 2025-08-31T19:32:02+00:00 |
URL | https://thepapcorps.org/robots.txt |
Redirect | https://www.thepapcorps.org/robots.txt |
Redirect Domain | www.thepapcorps.org |
Redirect Base | thepapcorps.org |
Domain IPs | 104.21.89.30, 172.67.136.144, 2606:4700:3030::6815:591e, 2606:4700:3032::ac43:8890 |
Redirect IPs | 104.21.89.30, 172.67.136.144, 2606:4700:3030::6815:591e, 2606:4700:3032::ac43:8890 |
Response IP | 104.21.89.30 |
Found | Yes |
Hash | f9131e389416106177996f04d364c7b7acf9ee49dce76e1baa2957f6be6775eb |
SimHash | c94188407d93 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-content/uploads/wc-logs/ |
Disallow | /wp-content/uploads/woocommerce_transient_files/ |
Disallow | /wp-content/uploads/woocommerce_uploads/ |
Disallow | /wp-admin/ |
Allow | /wp-admin/admin-ajax.php |
*
Rule | Path |
---|---|
Disallow | /wp-content/uploads/wpo/wpo-plugins-tables-list.json |
*
Rule | Path |
---|---|
Disallow |
Other Records
Field | Value |
---|---|
sitemap | https://www.thepapcorps.org/sitemap_index.xml |
Comments