houmashouse.com
robots.txt

Robots Exclusion Standard data for houmashouse.com

Resource Scan

Scan Details

Site Domain houmashouse.com
Base Domain houmashouse.com
Scan Status Ok
Last Scan2024-05-17T21:24:16+00:00
Next Scan 2024-06-16T21:24:16+00:00

Last Scan

Scanned2024-05-17T21:24:16+00:00
URL https://houmashouse.com/robots.txt
Domain IPs 192.0.66.206, 2a04:fa87:fffd::c000:4288
Response IP 192.0.66.206
Found Yes
Hash 11cb3b789cbeabc3d05e1e89851090d22ca1875d7ae3cac42c9f07842ccf6fdc
SimHash 8c47ce72c489

Groups

*

Rule Path
Allow /edit/wp-includes/js/
Disallow /edit/

megaindex.ru/2.0
megaindex.ru
megaindex.ru
mauibot (crawler.feedback+wc@gmail.com)
seekport crawler
blexbot
baiduspider
barkrowler
gigabot
go-http-client
nuclei
riddler
seznambot
wikido
yandex
zoominfobot
magpie-crawler

Rule Path
Disallow /
Disallow /fhbr-console/
Disallow /cdn-cgi/

Other Records

Field Value
sitemap https://houmashouse.com/sitemap.xml