outsidethabox.org
robots.txt

Robots Exclusion Standard data for outsidethabox.org

Resource Scan

Scan Details

Site Domain outsidethabox.org
Base Domain outsidethabox.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-04-07T15:32:03+00:00
Next Scan 2024-07-06T15:32:03+00:00

Last Successful Scan

Scanned2023-11-17T15:28:22+00:00
URL https://outsidethabox.org/robots.txt
Redirect https://www.outsidethabox.org/robots.txt
Redirect Domain www.outsidethabox.org
Redirect Base outsidethabox.org
Domain IPs 54.183.102.22
Redirect IPs 18.176.133.53, 54.95.115.3
Response IP 18.181.31.166
Found Yes
Hash e16035666f90064c9352720aa3c44ca9ca5de0e53d304e6ec0659baed73b8c88
SimHash aa8d6fad6450

Groups

semrushbot

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.outsidethabox.org/sitemap.xml

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-Agent: *
  • Disallow: /