javhouse.org
robots.txt

Robots Exclusion Standard data for javhouse.org

Resource Scan

Scan Details

Site Domain javhouse.org
Base Domain javhouse.org
Scan Status Ok
Last Scan2025-09-10T15:50:39+00:00
Next Scan 2025-10-10T15:50:39+00:00

Last Scan

Scanned2025-09-10T15:50:39+00:00
URL https://javhouse.org/robots.txt
Domain IPs 104.21.38.116, 172.67.222.147, 2606:4700:3034::ac43:de93, 2606:4700:3035::6815:2674
Response IP 104.21.38.116
Found Yes
Hash ad7fb0cf50ee4e3cae06761080a654b58d0bf98235c28a902aeeaa1ed904463d
SimHash 7d0db4706033

Groups

*

Rule Path
Disallow /engine/go.php
Disallow /engine/download.php
Disallow /user/
Disallow /newposts/
Disallow */page/*
Disallow /page/
Disallow /statistics.html
Disallow /*subaction%3Duserinfo
Disallow /*subaction%3Dnewposts
Disallow /*do%3Dlastcomments
Disallow /*do%3Dfeedback
Disallow /*do%3Dregister
Disallow /*do%3Dlostpassword
Disallow /*do%3Daddnews
Disallow /*do%3Dstats
Disallow /*do%3Dpm
Disallow /*do%3Dsearch
Disallow /*do%3Ddownload
Disallow /*do%3Dgo
Disallow /print
Disallow /print/*
Disallow /f/
Disallow /f/*
Disallow /sort/
Disallow /sort/star/*/page/*
Disallow /sort/director/*/page/*
Disallow /sort/studio/*/page/*
Allow /sort/star/*
Allow /sort/director/*
Allow /sort/studio/*

Other Records

Field Value
sitemap https://javhouse.org/sitemap.xml

Warnings

  • `host` is not a known field.