sjcctimes.com
robots.txt

Robots Exclusion Standard data for sjcctimes.com

Resource Scan

Scan Details

Site Domain sjcctimes.com
Base Domain sjcctimes.com
Scan Status Ok
Last Scan5/6/2025, 1:34:19 PM
Next Scan 6/5/2025, 1:34:19 PM

Last Scan

Scanned5/6/2025, 1:34:19 PM
URL https://sjcctimes.com/robots.txt
Domain IPs 104.21.78.56, 172.67.217.52, 2606:4700:3034::6815:4e38, 2606:4700:3034::ac43:d934
Response IP 172.67.217.52
Found Yes
Hash e0ca83fc35e470556e881511719dbbe3dc06ffe998ed008ffc89251d789c4e9a
SimHash e95cc66ac713

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /?s=
Disallow /*?*

dotbot

Rule Path
Disallow /