thewatersheds.com
robots.txt

Robots Exclusion Standard data for thewatersheds.com

Resource Scan

Scan Details

Site Domain thewatersheds.com
Base Domain thewatersheds.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-11-18T16:06:24+00:00
Next Scan 2026-01-17T16:06:24+00:00

Last Successful Scan

Scanned2025-09-20T07:24:28+00:00
URL https://thewatersheds.com/robots.txt
Domain IPs 104.21.93.174, 172.67.213.70, 2606:4700:3030::ac43:d546, 2606:4700:3031::6815:5dae
Response IP 172.67.213.70
Found Yes
Hash a01e793c8fc54b595b73495b13c716aea3ef52d38fdc2cc69b8fa65ea3c21241
SimHash 4d1042f257f5

Groups

googlebot

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

msnbot

Rule Path
Allow /

bingbot

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

twitterbot

Rule Path
Allow /

facebot

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot-video

Rule Path
Allow /

msnbot-media

Rule Path
Allow /

adidxbot

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

bingpreview

Rule Path
Allow /

panscient.com

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

yandex

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

*

Rule Path
Disallow /