ironman.com
robots.txt

Robots Exclusion Standard data for ironman.com

Resource Scan

Scan Details

Site Domain ironman.com
Base Domain ironman.com
Scan Status Ok
Last Scan2024-11-10T17:08:37+00:00
Next Scan 2024-11-17T17:08:37+00:00

Last Scan

Scanned2024-11-10T17:08:37+00:00
URL https://ironman.com/robots.txt
Domain IPs 104.16.223.243
Response IP 104.16.223.243
Found Yes
Hash db8a3b91e366006254b8bb96155a7103c1ede2417162f74cac39f90eebe148d9
SimHash 8800df40e4c3

Groups

*

Rule Path
Disallow /assets
Disallow /*/ical_instructions*
Disallow /documents
Disallow /requests
Disallow /*?

semrushbot-sa

Rule Path
Disallow

screaming frog seo spider

Rule Path
Disallow

*

Rule Path
Disallow /event/show_day
Disallow /event/*/*/*/*

Warnings

  • 2 invalid lines.