nationwidechildrens.org
robots.txt

Robots Exclusion Standard data for nationwidechildrens.org

Resource Scan

Scan Details

Site Domain nationwidechildrens.org
Base Domain nationwidechildrens.org
Scan Status Ok
Last Scan2024-10-20T14:08:02+00:00
Next Scan 2024-11-19T14:08:02+00:00

Last Scan

Scanned2024-10-20T14:08:02+00:00
URL https://nationwidechildrens.org/robots.txt
Redirect https://www.nationwidechildrens.org/robots.txt
Redirect Domain www.nationwidechildrens.org
Redirect Base nationwidechildrens.org
Domain IPs 69.24.144.75
Redirect IPs 69.24.144.75
Response IP 69.24.144.75
Found Yes
Hash fc45fa6cf8e5944735caa517325d7c1e4c50a65a83d821d976d1dd02e7dce8d9
SimHash ee50b099cc76

Groups

*

Rule Path
Disallow /sitecore
Disallow /sitecore_files
Disallow /temp
Disallow /upload
Disallow /utility
Disallow /specialties/behavioral-health/for-providers/community-behavioral-health-resource-directory/*
Disallow /home/*

Other Records

Field Value
crawl-delay 2

Other Records

Field Value
sitemap https://www.nationwidechildrens.org/sitemap_nch.xml

Comments

  • Robots directives for NCH
  • Disallow: /css
  • Disallow: /frontend
  • Disallow: /images
  • Disallow: /js

Warnings

  • 3 invalid lines.