inoc.com
robots.txt

Robots Exclusion Standard data for inoc.com

Resource Scan

Scan Details

Site Domain inoc.com
Base Domain inoc.com
Scan Status Ok
Last Scan2024-11-07T03:49:35+00:00
Next Scan 2024-12-07T03:49:35+00:00

Last Scan

Scanned2024-11-07T03:49:35+00:00
URL https://inoc.com/robots.txt
Redirect https://www.inoc.com/robots.txt
Redirect Domain www.inoc.com
Redirect Base inoc.com
Domain IPs 100.21.224.1
Redirect IPs 199.60.103.226, 199.60.103.30, 2606:2c40::c73c:671e, 2606:2c40::c73c:67e2
Response IP 199.60.103.226
Found Yes
Hash 2ddd5068cf180efdf38d8a238dac17a1d9706cdbcb948651ce2b6a1e4da6eb31
SimHash 7075de30c5b3

Groups

*

Rule Path
Disallow /sample-*
Disallow /blog/sample-*
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*

gptbot

Rule Path
Disallow /
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*