canon.ch
robots.txt

Robots Exclusion Standard data for canon.ch

Resource Scan

Scan Details

Site Domain canon.ch
Base Domain canon.ch
Scan Status Ok
Last Scan2024-09-25T12:41:47+00:00
Next Scan 2024-10-09T12:41:47+00:00

Last Scan

Scanned2024-09-25T12:41:47+00:00
URL https://canon.ch/robots.txt
Redirect https://www.canon.ch/robots.txt
Redirect Domain www.canon.ch
Redirect Base canon.ch
Domain IPs 20.113.157.133
Redirect IPs 23.32.29.11, 96.17.180.49
Response IP 23.52.40.73
Found Yes
Hash cf8e6b84164bb34bda9f6a6fba9b8eac688ee38b22cbdef6ef07dcd060d7ab7d
SimHash e800fa614511

Groups

*

Rule Path
Allow /

gsa-crawler

Rule Path
Allow /search/support/*

*

Rule Path
Disallow /Images/*.doc
Disallow /Images/*.xls
Disallow /Images/*.swf
Disallow /*?WT.srch=1
Disallow /*.axd$
Disallow /search/support/*

Comments

  • Sitemap: /sitemap.xml