thefitcareerist.com
robots.txt

Robots Exclusion Standard data for thefitcareerist.com

Resource Scan

Scan Details

Site Domain thefitcareerist.com
Base Domain thefitcareerist.com
Scan Status Ok
Last Scan4/2/2025, 1:13:20 PM
Next Scan 4/9/2025, 1:13:20 PM

Last Scan

Scanned4/2/2025, 1:13:20 PM
URL https://thefitcareerist.com/robots.txt
Domain IPs 104.21.57.147, 172.67.146.164, 2606:4700:3035::6815:3993, 2606:4700:3035::ac43:92a4
Response IP 104.21.57.147
Found Yes
Hash 7d60377d22581332643acebb51413bb254195d466a46af873e29870ca41bc9c5
SimHash 3a051d0500b2

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-login.php
Disallow /xmlrpc.php

*

Rule Path
Disallow /*.doc$
Disallow /*.pdf$
Disallow /*.zip$

Other Records

Field Value
sitemap https://thefitcareerist.com/sitemap_index.xml