artetcuriosites.com
robots.txt

Robots Exclusion Standard data for artetcuriosites.com

Resource Scan

Scan Details

Site Domain artetcuriosites.com
Base Domain artetcuriosites.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-09-26T07:38:12+00:00
Next Scan 2024-12-25T07:38:12+00:00

Last Successful Scan

Scanned2022-12-04T06:27:05+00:00
URL http://artetcuriosites.com/robots.txt
Redirect http://artetcuriosites.canalblog.com/robots.txt
Redirect Domain artetcuriosites.canalblog.com
Redirect Base canalblog.com
Domain IPs 195.137.184.101
Redirect IPs 104.18.24.250, 104.18.25.250
Response IP 104.18.24.250
Found Yes
Hash 5b52a6778e89a81081f9aceb0c9fe9d0b2f4f974d4bfae0cf4487344b0c5c2f2
SimHash 6b055c71c295

Groups

*

Rule Path
Disallow /cf/fe/remote/ffads.cfm

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 4

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 4

msnbot-media

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 4

pinterestbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 0.2

semrushbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

ahrefsbot

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

*

Rule Path
Disallow /cf/fe/remote/ffads.cfm

Other Records

Field Value
sitemap http://artetcuriosites.canalblog.com/rss.xml