retreatguide.com
robots.txt

Robots Exclusion Standard data for retreatguide.com

Resource Scan

Scan Details

Site Domain retreatguide.com
Base Domain retreatguide.com
Scan Status Ok
Last Scan2024-11-16T19:59:11+00:00
Next Scan 2024-11-23T19:59:11+00:00

Last Scan

Scanned2024-11-16T19:59:11+00:00
URL https://retreatguide.com/robots.txt
Redirect http://www.retreatguide.com/robots.txt
Redirect Domain www.retreatguide.com
Redirect Base retreatguide.com
Domain IPs 66.147.238.103
Redirect IPs 66.147.238.103
Response IP 66.147.238.103
Found Yes
Hash 961fb5096200ffaa86aa84cc88ac7e83bd778d2b2b762f09de33de8755ee1d7d
SimHash 484ad692d133

Groups

facebookexternalhit

Rule Path
Disallow /api/
Allow /

googlebot

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

yahoo-mmcrawler

Rule Path
Allow /

yahoo-slurp

Rule Path
Allow /

slurp

Rule Path
Allow /

msnbot

Rule Path
Allow /

msnbot-media

Rule Path
Allow /

bing preview bot

Rule Path
Allow /

twitterbot

Rule Path
Allow /

*

Rule Path
Disallow /

ahrefssiteaudit

Rule Path
Allow /

ahrefsbot

Rule Path
Allow /