sirfrancis.com
robots.txt
Robots Exclusion Standard data for sirfrancis.com
Resource Scan
Scan Details
Site Domain | sirfrancis.com |
Base Domain | sirfrancis.com |
Scan Status | Ok |
Last Scan | 2025-08-28T17:17:35+00:00 |
Next Scan | 2025-09-11T17:17:35+00:00 |
Last Scan
Scanned | 2025-08-28T17:17:35+00:00 |
URL | https://sirfrancis.com/robots.txt |
Domain IPs | 104.21.112.1, 104.21.16.1, 104.21.32.1, 104.21.48.1, 104.21.64.1, 104.21.80.1, 104.21.96.1, 2606:4700:3030::6815:1001, 2606:4700:3030::6815:2001, 2606:4700:3030::6815:3001, 2606:4700:3030::6815:4001, 2606:4700:3030::6815:5001, 2606:4700:3030::6815:6001, 2606:4700:3030::6815:7001 |
Response IP | 104.21.96.1 |
Found | Yes |
Hash | 18581a07928e9411bd29deae7316c9b9b0fedc8dc258b13470e30fdb252c01e8 |
SimHash | 63476f732ba7 |
Groups
*
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
*
Rule | Path |
---|---|
Allow | / |
Disallow | /*%26amp%3Blt%3Biframe |
Disallow | /*?currency= |
Disallow | /*/p*?page=* |
Disallow | /*/page-*?page=* |
Disallow | /cart |
Disallow | */redirect |
Other Records
Field | Value |
---|---|
sitemap | https://sirfrancis.com/sitemap.xml |
Warnings
- 13 invalid lines.