cgccusa.org
robots.txt

Robots Exclusion Standard data for cgccusa.org

Resource Scan

Scan Details

Site Domain cgccusa.org
Base Domain cgccusa.org
Scan Status Ok
Last Scan2025-08-05T18:12:45+00:00
Next Scan 2025-09-04T18:12:45+00:00

Last Scan

Scanned2025-08-05T18:12:45+00:00
URL https://cgccusa.org/robots.txt
Redirect https://www.cgccusa.org/robots.txt
Redirect Domain www.cgccusa.org
Redirect Base cgccusa.org
Domain IPs 199.60.103.100, 199.60.103.200
Redirect IPs 199.60.103.228, 199.60.103.28, 2606:2c40::c73c:671c, 2606:2c40::c73c:67e4
Response IP 199.60.103.228
Found Yes
Hash ea8e85a2b5420fc4539c4997c85a6821500c4e4a4ebb410b0c926f648fddb5ce
SimHash 3861c561c5b1

Groups

*

Rule Path
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*