newhorizons.com
robots.txt

Robots Exclusion Standard data for newhorizons.com

Resource Scan

Scan Details

Site Domain newhorizons.com
Base Domain newhorizons.com
Scan Status Ok
Last Scan2024-09-15T09:56:04+00:00
Next Scan 2024-10-15T09:56:04+00:00

Last Scan

Scanned2024-09-15T09:56:04+00:00
URL https://newhorizons.com/robots.txt
Redirect https://www.newhorizons.com/robots.txt
Redirect Domain www.newhorizons.com
Redirect Base newhorizons.com
Domain IPs 13.107.253.36
Redirect IPs 13.107.246.59, 2620:1ec:bdf::59
Response IP 13.107.246.59
Found Yes
Hash f899c057cc0ad3ce82a361ef08ecf0a4326e75f01be6d1ed825831acff2dcc2c
SimHash f90d195ba747

Groups

jooblebot

Rule Path
Disallow /

*

Rule Path Comment
Disallow /*/ctl/ Googlebot permits *
Disallow /*additemstocart* -
Disallow /admin/ -
Disallow /App_Browsers/ -
Disallow /App_Code/ -
Disallow /App_Data/ -
Disallow /App_GlobalResources/ -
Disallow /bin/ -
Disallow /cart/ -
Disallow /checkout/ -
Disallow /Components/ -
Disallow /Config/ -
Disallow /contest/ -
Disallow /controls/ -
Disallow /Documentation/ -
Disallow /HttpModules/ -
Disallow /Install/ -
Disallow /Providers/ -
Disallow /Activity-Feed/userId/ Do not index user profiles

Comments

  • Begin robots.txt file
  • /-----------------------------------------------\
  • | In single portal/domain situations, uncomment the sitmap line and enter domain name
  • \-----------------------------------------------/
  • End of robots.txt file