sanjoseinside.com
robots.txt
Robots Exclusion Standard data for sanjoseinside.com
Resource Scan
Scan Details
Site Domain | sanjoseinside.com |
Base Domain | sanjoseinside.com |
Scan Status | Ok |
Last Scan | 2024-09-26T10:07:07+00:00 |
Next Scan | 2024-10-03T10:07:07+00:00 |
Last Scan
Scanned | 2024-09-26T10:07:07+00:00 |
URL | https://sanjoseinside.com/robots.txt |
Domain IPs | 104.21.16.35, 172.67.166.18, 2606:4700:3032::ac43:a612, 2606:4700:3037::6815:1023 |
Response IP | 104.21.16.35 |
Found | Yes |
Hash | 786e88d3497c2ff735f6660f1b8170f686093a38c87aba20fbfcf6a07431fdf4 |
SimHash | 6b29dc60ab93 |
Groups
*
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 1 |
Other Records
Field | Value |
---|---|
sitemap | /event-sitemap.xml |
Warnings
- 1 invalid line.