trentstudents.org
robots.txt

Robots Exclusion Standard data for trentstudents.org

Resource Scan

Scan Details

Site Domain trentstudents.org
Base Domain trentstudents.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-03-23T22:19:22+00:00
Next Scan 2024-06-21T22:19:22+00:00

Last Successful Scan

Scanned2023-02-05T18:23:55+00:00
URL https://trentstudents.org/robots.txt
Redirect https://www.trentstudents.org/robots.txt
Redirect Domain www.trentstudents.org
Redirect Base trentstudents.org
Domain IPs 54.195.242.143
Redirect IPs 18.155.68.104, 18.155.68.112, 18.155.68.127, 18.155.68.3, 2600:9000:23d2:1200:4:f546:7880:93a1, 2600:9000:23d2:2000:4:f546:7880:93a1, 2600:9000:23d2:2400:4:f546:7880:93a1, 2600:9000:23d2:2c00:4:f546:7880:93a1, 2600:9000:23d2:3e00:4:f546:7880:93a1, 2600:9000:23d2:5c00:4:f546:7880:93a1, 2600:9000:23d2:fa00:4:f546:7880:93a1, 2600:9000:23d2:fe00:4:f546:7880:93a1
Response IP 18.155.68.127
Found Yes
Hash 03d5dcf5887db9fcf28baa5a126a2959b4dcc9d0c09a18bc6c6c0a8961289314
SimHash b2850d8d6370

Groups

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-agent: *
  • Disallow: /