stranahanhouse.org
robots.txt

Robots Exclusion Standard data for stranahanhouse.org

Resource Scan

Scan Details

Site Domain stranahanhouse.org
Base Domain stranahanhouse.org
Scan Status Ok
Last Scan2024-09-14T22:46:21+00:00
Next Scan 2024-10-14T22:46:21+00:00

Last Scan

Scanned2024-09-14T22:46:21+00:00
URL https://stranahanhouse.org/robots.txt
Domain IPs 192.0.66.239, 2a04:fa87:fffd::c000:427d
Response IP 192.0.66.239
Found Yes
Hash fca77525f8605e2293cd72f4392e962829a75bddac270f2493b47935b11ed168
SimHash cc27ce72c789

Groups

*

Rule Path
Allow /edit/wp-includes/js/
Disallow /edit/

megaindex.ru/2.0
megaindex.ru
megaindex.ru
mauibot (crawler.feedback+wc@gmail.com)
seekport crawler
blexbot
baiduspider
barkrowler
gigabot
go-http-client
nuclei
riddler
seznambot
wikido
yandex
zoominfobot
magpie-crawler

Rule Path
Disallow /
Disallow /fhbr-console/
Disallow /cdn-cgi/

Other Records

Field Value
sitemap https://stranahanhouse.org/sitemap.xml