dcclothesline.com
robots.txt

Robots Exclusion Standard data for dcclothesline.com

Resource Scan

Scan Details

Site Domain dcclothesline.com
Base Domain dcclothesline.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-09-21T07:54:07+00:00
Next Scan 2024-12-20T07:54:07+00:00

Last Successful Scan

Scanned2024-02-17T03:44:12+00:00
URL https://dcclothesline.com/robots.txt
Domain IPs 104.21.69.197, 172.67.212.203, 2606:4700:3032::ac43:d4cb, 2606:4700:3037::6815:45c5
Response IP 172.67.212.203
Found Yes
Hash 78f0851b2c26b24d35b5dd5795bb41e635fcd88e051ecdaa13ea9c8ee61505c9
SimHash 6e025b85c0b7

Groups

*

Rule Path
Disallow /?blackhole

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 120

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 120

msnbot-media

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 120

*

Rule Path
Disallow /wp-content/uploads/*
Disallow /?s=
Disallow /search/
Disallow /wp-login.php
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
crawl-delay 30

twitterbot

Rule Path
Allow *

facebookexternalhit

Rule Path
Allow *

facebot

Rule Path
Allow *

baiduspider
baiduspider-image
baiduspider-video
baiduspider-news
baiduspider-favo
baiduspider-ads
baiduspider-cpro
genieo
hoaxybot
laserlikebot
semrushbot
seoscanners.net
seznambot
spbot
storygizebot
yandex
yandexbot
yandeximages
yandexmobilebot

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.dcclothesline.com/sitemap_index.xml