pen-house.net
robots.txt

Robots Exclusion Standard data for pen-house.net

Resource Scan

Scan Details

Site Domain pen-house.net
Base Domain pen-house.net
Scan Status Ok
Last Scan2024-11-12T04:22:05+00:00
Next Scan 2024-11-26T04:22:05+00:00

Last Scan

Scanned2024-11-12T04:22:05+00:00
URL https://pen-house.net/robots.txt
Domain IPs 18.155.68.10, 18.155.68.124, 18.155.68.16, 18.155.68.24
Response IP 18.155.68.10
Found Yes
Hash 8c60dc88acd56d7e42a1f3c2874bb6f64f25b0fcacd40c5d76c509520b8646c8
SimHash 673ab044efab

Groups

*

Rule Path
Disallow /admin.html
Disallow /item_download.html
Disallow /item_download_real.html

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

baiduspider

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

yetbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5