joshcaratelli.com
robots.txt

Robots Exclusion Standard data for joshcaratelli.com

Resource Scan

Scan Details

Site Domain joshcaratelli.com
Base Domain joshcaratelli.com
Scan Status Ok
Last Scan2025-10-08T06:52:44+00:00
Next Scan 2025-11-07T06:52:44+00:00

Last Scan

Scanned2025-10-08T06:52:44+00:00
URL https://joshcaratelli.com/robots.txt
Redirect https://www.joshcaratelli.com/robots.txt
Redirect Domain www.joshcaratelli.com
Redirect Base joshcaratelli.com
Domain IPs 54.183.102.22
Redirect IPs 18.181.31.166, 54.248.227.74
Response IP 54.95.115.3
Found Yes
Hash a190e69271314696036676bfd8da9f7831f728487e71187cf2010af161945365
SimHash b28d6fad6450

Groups

semrushbot

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.joshcaratelli.com/sitemap.xml

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-Agent: *
  • Disallow: /