include.com
robots.txt

Robots Exclusion Standard data for include.com

Resource Scan

Scan Details

Site Domain include.com
Base Domain include.com
Scan Status Ok
Last Scan2024-09-08T11:46:59+00:00
Next Scan 2024-10-08T11:46:59+00:00

Last Scan

Scanned2024-09-08T11:46:59+00:00
URL https://www.include.com/robots.txt
Domain IPs 199.60.103.226, 199.60.103.30, 2606:2c40::c73c:671e, 2606:2c40::c73c:67e2
Response IP 199.60.103.30
Found Yes
Hash 2357f04cef001eabeef5ad2e2d1b8ce1fd6bd71fbae5f2a99d6a81c9370af013
SimHash 28eddfa9d592

Groups

*

Rule Path
Disallow /thank-you-sales-tools-tip-sheet
Disallow /thank-you-payroll-outsourcing-guide
Disallow /thank-you-operations-tip-sheet-0
Disallow /thank-you-reporting-tip-sheet
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*

Warnings

  • `noindex` is not a known field.