myschoolgist.com
robots.txt

Robots Exclusion Standard data for myschoolgist.com

Resource Scan

Scan Details

Site Domain myschoolgist.com
Base Domain myschoolgist.com
Scan Status Ok
Last Scan2024-06-27T03:39:45+00:00
Next Scan 2024-07-04T03:39:45+00:00

Last Scan

Scanned2024-06-27T03:39:45+00:00
URL https://myschoolgist.com/robots.txt
Redirect https://www.myschoolgist.com/robots.txt
Redirect Domain www.myschoolgist.com
Redirect Base myschoolgist.com
Domain IPs 104.26.10.222, 104.26.11.222, 172.67.70.100, 2606:4700:20::681a:ade, 2606:4700:20::681a:bde, 2606:4700:20::ac43:4664
Redirect IPs 104.26.10.222, 104.26.11.222, 172.67.70.100, 2606:4700:20::681a:ade, 2606:4700:20::681a:bde, 2606:4700:20::ac43:4664
Response IP 172.67.70.100
Found Yes
Hash fd5c04548035858aefe9ac822e29a3b064d88b12e5103fc5d2c0778a22c9c0c2
SimHash cea6dd75fca9

Groups

*

Rule Path
Disallow /wp-json/
Disallow /?s=*
Disallow /search/*
Disallow /cdn-cgi/bm/cv/
Disallow /cdn-cgi/challenge-platform/

rogerbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

nuclei
wikido
riddler
petalbot
zoominfobot
go-http-client
node/simplecrawler
cazoodlebot
dotbot/1.0
gigabot
barkrowler
blexbot
magpie-crawler
domain re-animator bot
obot
turnitinbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.myschoolgist.com/sitemap_index.xml

Comments

  • Hi robots! Glad you came!
  • Humans, please refer to /humans.txt