blog.thebestjohn.com
robots.txt

Robots Exclusion Standard data for blog.thebestjohn.com

Resource Scan

Scan Details

Site Domain blog.thebestjohn.com
Base Domain thebestjohn.com
Scan Status Ok
Last Scan2025-10-16T01:29:14+00:00
Next Scan 2025-11-15T01:29:14+00:00

Last Scan

Scanned2025-10-16T01:29:14+00:00
URL https://blog.thebestjohn.com/robots.txt
Domain IPs 104.21.39.137, 172.67.170.204, 2606:4700:3030::ac43:aacc, 2606:4700:3037::6815:2789
Response IP 104.21.39.137
Found Yes
Hash c298ba1a02ada7901d09f8745b9028b4a1c54402dfa8ebf7d61c8dae0b0e7467
SimHash 651d8945c493

Groups

*

Rule Path
Allow /posts
Allow /about
Allow /glossary

Other Records

Field Value
sitemap /sitemap.xml