ironman.com
robots.txt
Robots Exclusion Standard data for ironman.com
Resource Scan
Scan Details
Site Domain | ironman.com |
Base Domain | ironman.com |
Scan Status | Ok |
Last Scan | 2024-05-26T06:46:38+00:00 |
Next Scan | 2024-06-02T06:46:38+00:00 |
Last Scan
Scanned | 2024-05-26T06:46:38+00:00 |
URL | https://ironman.com/robots.txt |
Domain IPs | 104.16.223.243 |
Response IP | 104.16.223.243 |
Found | Yes |
Hash | 5ad709d2ea943546782b9943c2b3deb798a53210fcf14f9f3ff7caaf2b8dbab6 |
SimHash | a80adbc0e8e3 |
Groups
*
Rule | Path |
---|---|
Disallow | /assets |
Disallow | /*/ical_instructions* |
Disallow | /documents |
Disallow | /requests |
*
Rule | Path |
---|---|
Disallow | /event/show_day |
Disallow | /event/*/*/*/* |
Warnings
- 2 invalid lines.