scouterecruit.net
robots.txt

Robots Exclusion Standard data for scouterecruit.net

Archived Snapshots

Resource Scan

Scan Details

Site Domain	scouterecruit.net
Base Domain	scouterecruit.net
Scan Status	Ok
Last Scan	2024-05-24T13:24:35+00:00
Next Scan	2024-06-23T13:24:35+00:00

Last Scan

Scanned	2024-05-24T13:24:35+00:00
URL	https://scouterecruit.net/robots.txt
Domain IPs	13.33.88.111, 13.33.88.2, 13.33.88.65, 13.33.88.77
Response IP	13.33.88.2
Found	Yes
Hash	68615296d8a418891663d7810f878f63d8d6a9b25de14e705bdec1645294a9db
SimHash	860d6d0d6640

Groups

*

Rule	Path
Disallow	/admin
Disallow	/charts/
Disallow	/images/
Disallow	/stylesheets/
Disallow	/javascript/
Disallow	/w3c/
Disallow	/404.html
Disallow	/422.html
Disallow	/500.html
Disallow	/jobs/ni/
Disallow	/Jobs/ni/
Disallow	/applications/
Disallow	/jobs/NSBC3-workforce-register

Rule

Path

Disallow

/admin

Disallow

/charts/

Disallow

/images/

Disallow

/stylesheets/

Disallow

/javascript/

Disallow

/w3c/

Disallow

/404.html

Disallow

/422.html

Disallow

/500.html

Disallow

/jobs/ni/

Disallow

/Jobs/ni/

Disallow

/applications/

Disallow

/jobs/NSBC3-workforce-register

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Comments

See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
To ban all spiders from the entire site uncomment the next two lines:
Disallow: /

Back to top

scouterecruit.netrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

semrushbot

Comments

scouterecruit.net
robots.txt