sanjose.com
robots.txt
Robots Exclusion Standard data for sanjose.com
Resource Scan
Scan Details
| Site Domain | sanjose.com |
| Base Domain | sanjose.com |
| Scan Status | Ok |
| Last Scan | 2026-02-21T00:39:05+00:00 |
| Next Scan | 2026-02-28T00:39:05+00:00 |
Last Scan
| Scanned | 2026-02-21T00:39:05+00:00 |
| URL | https://sanjose.com/robots.txt |
| Redirect | https://www.sanjose.com/robots.txt |
| Redirect Domain | www.sanjose.com |
| Redirect Base | sanjose.com |
| Domain IPs | 104.21.15.151, 172.67.206.86, 2606:4700:3031::6815:f97, 2606:4700:3033::ac43:ce56 |
| Redirect IPs | 104.21.15.151, 172.67.206.86, 2606:4700:3031::6815:f97, 2606:4700:3033::ac43:ce56 |
| Response IP | 172.67.206.86 |
| Found | Yes |
| Hash | b4496e5ccfa8b2e3f680ef125138cc6b5ae45a7981723140504e51cb5ee98936 |
| SimHash | 291c5270cc91 |
Groups
*
No rules defined. All paths allowed.
Other Records
| Field | Value |
|---|---|
| crawl-delay | 1 |
Other Records
| Field | Value |
|---|---|
| sitemap | http://www.sanjose.com/sitemap_index.xml |