councilcan.org
robots.txt

Robots Exclusion Standard data for councilcan.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	councilcan.org
Base Domain	councilcan.org
Scan Status	Ok
Last Scan	2024-11-15T21:43:23+00:00
Next Scan	2024-11-29T21:43:23+00:00

Last Scan

Scanned	2024-11-15T21:43:23+00:00
URL	https://councilcan.org/robots.txt
Domain IPs	104.17.59.185
Response IP	104.17.59.185
Found	Yes
Hash	c0b1c273ab994252f4be71016743424b1a4895a8d11a486e4cf9888938fd7d54
SimHash	2d459801e9db

Groups

*

Rule	Path
Disallow	/pick-institution
Disallow	/terms
Disallow	/privacy-policy
Disallow	/legal
Disallow	/backoffice
Disallow	/networks/*/recruiter/jobs

Rule

Path

Disallow

/pick-institution

Disallow

/terms

Disallow

/privacy-policy

Disallow

/legal

Disallow

/backoffice

Disallow

/networks/*/recruiter/jobs

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

ut-dorkbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

ut-dorkbot/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://councilcan.org/sitemap.xml

Field

Value

sitemap

https://councilcan.org/sitemap.xml

Back to top

councilcan.orgrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

mj12bot

semrushbot

ut-dorkbot

ut-dorkbot/1.0

Other Records

councilcan.org
robots.txt