gitedugardon.com
robots.txt

Robots Exclusion Standard data for gitedugardon.com

Resource Scan

Scan Details

Site Domain gitedugardon.com
Base Domain gitedugardon.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-05-29T14:47:25+00:00
Next Scan 2024-08-27T14:47:25+00:00

Last Successful Scan

Scanned2022-10-27T21:46:55+00:00
URL http://www.gitedugardon.com/robots.txt
Redirect http://gitedugardon.canalblog.com/robots.txt
Redirect Domain gitedugardon.canalblog.com
Redirect Base canalblog.com
Response IP 195.137.184.101
Found Yes
Hash b23523ce42706efee8cb34c17b7ecdde3f23bf2f3a7e5ee7c2632879592baf0f
SimHash 6b05dc71c2b5

Groups

*

Rule Path
Disallow /cf/fe/remote/ffads.cfm

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 4

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 4

msnbot-media

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 4

semrushbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

ahrefsbot

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

*

Rule Path
Disallow /cf/fe/remote/ffads.cfm

Other Records

Field Value
sitemap http://gitedugardon.canalblog.com/rss.xml