givecampus.com
robots.txt

Robots Exclusion Standard data for givecampus.com

Resource Scan

Scan Details

Site Domain givecampus.com
Base Domain givecampus.com
Scan Status Ok
Last Scan2024-10-14T18:44:19+00:00
Next Scan 2024-11-13T18:44:19+00:00

Last Scan

Scanned2024-10-14T18:44:19+00:00
URL https://givecampus.com/robots.txt
Redirect https://www.givecampus.com/robots.txt
Redirect Domain www.givecampus.com
Redirect Base givecampus.com
Domain IPs 104.16.156.89, 104.17.6.65, 2606:4700::6810:9c59, 2606:4700::6811:641
Redirect IPs 104.16.156.89, 104.17.6.65, 2606:4700::6810:9c59, 2606:4700::6811:641
Response IP 104.17.6.65
Found Yes
Hash f4434485c9f89d890724018d5f028c1aba154941b1d195991ebe08c48705c365
SimHash bac822dde4e1

Groups

linkedinbot

Rule Path
Allow /schools/*
Allow /campaigns/*
Disallow /users/*

facebookexternalhit

Rule Path
Allow /schools/*
Allow /campaigns/*
Disallow /users/*

twitterbot

Rule Path
Allow /schools/*
Allow /campaigns/*
Disallow /users/*

*

Rule Path
Disallow /admin
Disallow /campaigns/*
Disallow /explore
Disallow /schools/*/precreate
Disallow /schools/*/onboarding
Disallow /schools/*/onboarding_done
Disallow /schools/*/partner_now
Disallow /schools/*/embed
Disallow /schools/*/endowments
Disallow /schools/*/admin
Disallow /schools/*/manage
Disallow /schools/*/archive
Disallow /schools/*/donate
Disallow /schools/*/email_templates
Disallow /schools/*/check_wepay_account
Disallow /schools/*/new_general
Disallow /schools/*/events/*/register/*
Disallow /schools/deerfieldacademy/*
Disallow /schools/IllinoisWesleyanUniversity/*
Disallow /schools/ucmerced/*
Disallow /schools/universityofcaliforniamerced/*
Disallow /schools/syracuseuniversity/*
Disallow /redactor_rails/*
Disallow /users/*
Disallow /auth/*
Disallow /gc_video/*
Disallow /schools/*/gc_video/*

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-Agent: *
  • Disallow: /