penciljack.com
robots.txt

Robots Exclusion Standard data for penciljack.com

Resource Scan

Scan Details

Site Domain penciljack.com
Base Domain penciljack.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-10-23T15:49:58+00:00
Next Scan 2026-01-21T15:49:58+00:00

Last Successful Scan

Scanned2024-04-01T05:08:35+00:00
URL https://www.penciljack.com/robots.txt
Domain IPs 104.18.207.254, 104.18.208.24, 104.18.223.254, 104.18.224.20, 104.18.224.24, 2606:4700::6812:cffe, 2606:4700::6812:d018, 2606:4700::6812:dffe, 2606:4700::6812:e014, 2606:4700::6812:e018
Response IP 104.18.207.254
Found Yes
Hash e9acee9d5c09614d9ec5ea203f418a9c198f1fab684aaf10d7f27103637ffd78
SimHash 302cdc70a013

Groups

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

*

Rule Path
Disallow /admincp/
Disallow /modcp/
Disallow /attachments/
Disallow /images/
Disallow /members/
Disallow /search/
Disallow /new-content/
Disallow /auth/
Disallow /register
Disallow /uploader/url

baiduspider

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap /core/xmlsitemap.php