homeowatch.org
robots.txt

Robots Exclusion Standard data for homeowatch.org

Resource Scan

Scan Details

Site Domain homeowatch.org
Base Domain homeowatch.org
Scan Status Ok
Last Scan2025-05-03T17:17:57+00:00
Next Scan 2025-06-02T17:17:57+00:00

Last Scan

Scanned2025-05-03T17:17:57+00:00
URL https://homeowatch.org/robots.txt
Redirect https://quackwatch.org/robots.txt
Redirect Domain quackwatch.org
Redirect Base quackwatch.org
Domain IPs 67.227.143.240
Redirect IPs 104.21.93.203, 172.67.214.94, 2606:4700:3035::ac43:d65e, 2606:4700:3037::6815:5dcb
Response IP 104.21.93.203
Found Yes
Hash 1da869a0fb9d1a4015897e1da08566b7c2df83d484eb9d706d82aa661e8e4d14
SimHash 6ab65a32c2a3

Groups

mauibot

Rule Path
Disallow /

facebookexternalhit

Rule Path
Allow /

applebot

Rule Path
Allow /

baiduspider

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

facebot

Rule Path
Allow /

googlebot

Rule Path
Allow /
Allow /wp-content/uploads/
Disallow /activity/
Disallow /?acpage=
Disallow /s/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /*/feed/

msnbot

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

slurp

Rule Path
Allow /

teoma

Rule Path
Disallow /

twitterbot

Rule Path
Allow /

yandex

Rule Path
Disallow /

yeti

Rule Path
Disallow /

ahrefssiteaudit

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

facebookexternalhit

Rule Path
Allow /

*

Rule Path
Disallow /

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5