canon.be
robots.txt

Robots Exclusion Standard data for canon.be

Resource Scan

Scan Details

Site Domain canon.be
Base Domain canon.be
Scan Status Ok
Last Scan2024-09-24T12:34:04+00:00
Next Scan 2024-10-08T12:34:04+00:00

Last Scan

Scanned2024-09-24T12:34:04+00:00
URL https://canon.be/robots.txt
Redirect https://www.canon.be/robots.txt
Redirect Domain www.canon.be
Redirect Base canon.be
Domain IPs 20.113.157.133
Redirect IPs 23.32.29.90, 23.32.29.98
Response IP 23.32.29.98
Found Yes
Hash cf8e6b84164bb34bda9f6a6fba9b8eac688ee38b22cbdef6ef07dcd060d7ab7d
SimHash e800fa614511

Groups

*

Rule Path
Allow /

gsa-crawler

Rule Path
Allow /search/support/*

*

Rule Path
Disallow /Images/*.doc
Disallow /Images/*.xls
Disallow /Images/*.swf
Disallow /*?WT.srch=1
Disallow /*.axd$
Disallow /search/support/*

Comments

  • Sitemap: /sitemap.xml