newstateschools.wpcomstaging.com
robots.txt

Robots Exclusion Standard data for newstateschools.wpcomstaging.com

Resource Scan

Scan Details

Site Domain newstateschools.wpcomstaging.com
Base Domain wpcomstaging.com
Scan Status Ok
Last Scan2025-11-22T07:28:00+00:00
Next Scan 2025-12-22T07:28:00+00:00

Last Scan

Scanned2025-11-22T07:28:00+00:00
URL https://newstateschools.wpcomstaging.com/robots.txt
Redirect https://newstateschools.org/robots.txt
Redirect Domain newstateschools.org
Redirect Base newstateschools.org
Domain IPs 192.0.78.20
Redirect IPs 192.0.78.180, 192.0.78.211
Response IP 192.0.78.211
Found Yes
Hash a65c9ca448420518327c1ee371d9766acd2074d9abb918fdc9d88f4d0852a79a
SimHash ebc080a2cfb3

Groups

*

Rule Path
Disallow /wp-content/uploads/wc-logs/
Disallow /wp-content/uploads/woocommerce_transient_files/
Disallow /wp-content/uploads/woocommerce_uploads/
Disallow /*?add-to-cart=
Disallow /*?*add-to-cart=
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://newstateschools.org/sitemap.xml
sitemap https://newstateschools.org/news-sitemap.xml
sitemap https://newstateschools.org/sitemap_index.xml