studyabroad101.com
robots.txt

Robots Exclusion Standard data for studyabroad101.com

Resource Scan

Scan Details

Site Domain studyabroad101.com
Base Domain studyabroad101.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-04-05T12:13:49+00:00
Next Scan 2025-07-04T12:13:49+00:00

Last Successful Scan

Scanned2023-08-20T01:58:25+00:00
URL https://studyabroad101.com/robots.txt
Redirect https://www.studyabroad101.com/robots.txt
Redirect Domain www.studyabroad101.com
Redirect Base studyabroad101.com
Domain IPs 172.66.40.95, 172.66.43.161, 2606:4700:3108::ac42:285f, 2606:4700:3108::ac42:2ba1
Redirect IPs 172.66.40.95, 172.66.43.161, 2606:4700:3108::ac42:285f, 2606:4700:3108::ac42:2ba1
Response IP 172.66.40.95
Found Yes
Hash 40f0a2e28b0e609f3230e200fe395b3ab88cf4b32ac0e6ed2422c6679c3632e2
SimHash 24a70d286d10

Groups

*

Rule Path
Disallow /admin
Disallow /secret_key
Disallow /users/
Disallow /advisor/
Disallow /provider_admin/
Disallow /resque/
Disallow /review-your-program
Disallow *fb-register*
Disallow /pictures/
Disallow /node/
Disallow /favorite_nodes/
Disallow /scanned/
Disallow /reviews/scanned/
Disallow /privatemsg/
Disallow /country/
Disallow /city/
Disallow /submit-media-likes
Disallow /search
Disallow /abroad101/search/
Disallow /programs/lookup
Disallow /study_subjects/
Disallow /super_admin_login
Disallow /begin_review/
Disallow /image_show*
Disallow /files/
Disallow /taxonomy/
Disallow */ratings
Disallow /lead_followups/
Disallow /cdn-cgi/
Disallow /press

Other Records

Field Value
sitemap http://s3.amazonaws.com/abroad101/sitemaps/sitemap_index.xml.gz
sitemap https://studyabroad101.com/blog/sitemap.xml.gz

Comments

  • Sign in, out links
  • country and city aliases are /country and /city; this hides duplicate (actual URL)
  • Cloudflare
  • Press page