artetcuriosites.com
robots.txt

Robots Exclusion Standard data for artetcuriosites.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	artetcuriosites.com
Base Domain	artetcuriosites.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2024-09-26T07:38:12+00:00
Next Scan	2024-12-25T07:38:12+00:00

Last Successful Scan

Scanned	2022-12-04T06:27:05+00:00
URL	http://artetcuriosites.com/robots.txt
Redirect	http://artetcuriosites.canalblog.com/robots.txt
Redirect Domain	artetcuriosites.canalblog.com
Redirect Base	canalblog.com
Domain IPs	195.137.184.101
Redirect IPs	104.18.24.250, 104.18.25.250
Response IP	104.18.24.250
Found	Yes
Hash	5b52a6778e89a81081f9aceb0c9fe9d0b2f4f974d4bfae0cf4487344b0c5c2f2
SimHash	6b055c71c295

Groups

*

Rule	Path
Disallow	/cf/fe/remote/ffads.cfm

Rule

Path

Disallow

/cf/fe/remote/ffads.cfm

bingbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	4

Field

Value

crawl-delay

4

msnbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	4

Field

Value

crawl-delay

4

msnbot-media

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	4

Field

Value

crawl-delay

4

pinterestbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	0.2

Field

Value

crawl-delay

0.2

semrushbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

5

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

seekportbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

*

Rule	Path
Disallow	/cf/fe/remote/ffads.cfm

Rule

Path

Disallow

/cf/fe/remote/ffads.cfm

Back to top

Other Records

Field	Value
sitemap	http://artetcuriosites.canalblog.com/rss.xml

Field

Value

sitemap

http://artetcuriosites.canalblog.com/rss.xml

Back to top

artetcuriosites.comrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

bingbot

Other Records

msnbot

Other Records

msnbot-media

Other Records

pinterestbot

Other Records

semrushbot

Other Records

ahrefsbot

seekportbot

*

Other Records

artetcuriosites.com
robots.txt