cgspectrum.com
robots.txt

Robots Exclusion Standard data for cgspectrum.com

Resource Scan

Scan Details

Site Domain cgspectrum.com
Base Domain cgspectrum.com
Scan Status Ok
Last Scan2025-11-14T19:04:03+00:00
Next Scan 2025-12-14T19:04:03+00:00

Last Scan

Scanned2025-11-14T19:04:03+00:00
URL https://cgspectrum.com/robots.txt
Redirect https://www.cgspectrum.com/robots.txt
Redirect Domain www.cgspectrum.com
Redirect Base cgspectrum.com
Domain IPs 104.21.46.254, 172.67.143.135, 2606:4700:3030::6815:2efe, 2606:4700:3030::ac43:8f87
Redirect IPs 199.60.103.228, 199.60.103.28, 2606:2c40::c73c:671c, 2606:2c40::c73c:67e4
Response IP 199.60.103.228
Found Yes
Hash b8b6047c19a3007d809cee85b629fef9d0ecf90a4976a01db0942e7bffd5d2ec
SimHash 2c54dc75c691

Groups

*

Rule Path
Allow /
Allow /_hcms/*.js*
Allow /_hcms/*.css*
Allow /_hcms/*.png*
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /_hcms/perf
Disallow /blog-redirect
Disallow /dev-blog-0129521851059128510293101
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*

Other Records

Field Value
sitemap https://www.cgspectrum.com/sitemap.xml

Warnings

  • 2 invalid lines.