docs.thejustinhq.com
robots.txt

Robots Exclusion Standard data for docs.thejustinhq.com

Resource Scan

Scan Details

Site Domain docs.thejustinhq.com
Base Domain thejustinhq.com
Scan Status Ok
Last Scan2025-10-22T21:18:26+00:00
Next Scan 2025-11-21T21:18:26+00:00

Last Scan

Scanned2025-10-22T21:18:26+00:00
URL https://docs.thejustinhq.com/robots.txt
Redirect https://docs.thejustinhq.com/kb/robots.txt
Domain IPs 104.18.40.47, 172.64.147.209, 2606:4700:4402::ac40:93d1, 2606:4700:4407::6812:282f
Response IP 104.18.40.47
Found Yes
Hash e9bcb2d4fb99f83d442cc15ec9874905771fe09301e8016eb217bad0658172d3
SimHash 5100da418713

Groups

*

Rule Path
Disallow /*?*q=*
Disallow /*?*ask=*
Allow /~gitbook/image?*
Allow /~gitbook/icon?*
Allow /favicon.ico
Allow /

Other Records

Field Value
sitemap https://docs.thejustinhq.com/kb/sitemap.xml