ironman.com
robots.txt
Robots Exclusion Standard data for ironman.com
Resource Scan
Scan Details
Site Domain | ironman.com |
Base Domain | ironman.com |
Scan Status | Ok |
Last Scan | 2024-11-10T17:08:37+00:00 |
Next Scan | 2024-11-17T17:08:37+00:00 |
Last Scan
Scanned | 2024-11-10T17:08:37+00:00 |
URL | https://ironman.com/robots.txt |
Domain IPs | 104.16.223.243 |
Response IP | 104.16.223.243 |
Found | Yes |
Hash | db8a3b91e366006254b8bb96155a7103c1ede2417162f74cac39f90eebe148d9 |
SimHash | 8800df40e4c3 |
Groups
*
Rule | Path |
---|---|
Disallow | /assets |
Disallow | /*/ical_instructions* |
Disallow | /documents |
Disallow | /requests |
Disallow | /*? |
*
Rule | Path |
---|---|
Disallow | /event/show_day |
Disallow | /event/*/*/*/* |
Warnings
- 2 invalid lines.