chaossearch.io
robots.txt

Robots Exclusion Standard data for chaossearch.io

Resource Scan

Scan Details

Site Domain chaossearch.io
Base Domain chaossearch.io
Scan Status Ok
Last Scan2024-10-20T13:55:52+00:00
Next Scan 2024-11-19T13:55:52+00:00

Last Scan

Scanned2024-10-20T13:55:52+00:00
URL https://chaossearch.io/robots.txt
Redirect https://www.chaossearch.io/robots.txt
Redirect Domain www.chaossearch.io
Redirect Base chaossearch.io
Domain IPs 13.33.88.106, 13.33.88.117, 13.33.88.24, 13.33.88.51
Redirect IPs 199.60.103.2, 199.60.103.254, 2606:2c40::c73c:6702, 2606:2c40::c73c:67fe
Response IP 199.60.103.2
Found Yes
Hash 8ad7c33d1b4445d689f2ee39ad5da88b2634a84f78b0996bab1af35c9dbf0f0e
SimHash 7a64ce38cdb3

Groups

*

Rule Path
Disallow /sample-*
Disallow /blog/sample-*
Disallow /hubfs
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*

Other Records

Field Value
sitemap https://www.chaossearch.io/sitemap.xml