/.well-known/

Log In Sign Up

outsidethabox.org
robots.txt

Robots Exclusion Standard data for outsidethabox.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	outsidethabox.org
Base Domain	outsidethabox.org
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2024-04-07T15:32:03+00:00
Next Scan	2024-07-06T15:32:03+00:00

Last Successful Scan

Scanned	2023-11-17T15:28:22+00:00
URL	https://outsidethabox.org/robots.txt
Redirect	https://www.outsidethabox.org/robots.txt
Redirect Domain	www.outsidethabox.org
Redirect Base	outsidethabox.org
Domain IPs	54.183.102.22
Redirect IPs	18.176.133.53, 54.95.115.3
Response IP	18.181.31.166
Found	Yes
Hash	e16035666f90064c9352720aa3c44ca9ca5de0e53d304e6ec0659baed73b8c88
SimHash	aa8d6fad6450

Groups

semrushbot

Rule

Path

Disallow

/

blackwidow

Rule

Path

Disallow

/

Back to top

Other Records

Field

Value

sitemap

https://www.outsidethabox.org/sitemap.xml

Back to top

Comments

See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
To ban all spiders from the entire site uncomment the next two lines:
User-Agent: *
Disallow: /

Back to top