businesscareercollege.com
robots.txt

Robots Exclusion Standard data for businesscareercollege.com

Resource Scan

Scan Details

Site Domain businesscareercollege.com
Base Domain businesscareercollege.com
Scan Status Ok
Last Scan2024-09-23T16:18:23+00:00
Next Scan 2024-10-07T16:18:23+00:00

Last Scan

Scanned2024-09-23T16:18:23+00:00
URL https://businesscareercollege.com/robots.txt
Domain IPs 13.248.132.211, 76.223.7.231
Response IP 13.248.132.211
Found Yes
Hash d8d3ee55464a49ae3b0ad7e99df9040f2383371edc8bb0d3b5ecaa920c6a719c
SimHash f2842d8de651

Groups

*

Rule Path
Disallow /app/
Disallow /admin/
Disallow /scorm/
Disallow /orders/
Disallow /checkout/
Disallow /search/
Disallow /users/confirmation/

Other Records

Field Value
sitemap https://businesscareercollege.com/sitemap.xml

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-agent: *
  • Disallow: /