marshfieldclinic.org
robots.txt

Robots Exclusion Standard data for marshfieldclinic.org

Resource Scan

Scan Details

Site Domain marshfieldclinic.org
Base Domain marshfieldclinic.org
Scan Status Ok
Last Scan2024-09-13T15:23:32+00:00
Next Scan 2024-10-13T15:23:32+00:00

Last Scan

Scanned2024-09-13T15:23:32+00:00
URL https://marshfieldclinic.org/robots.txt
Domain IPs 192.236.16.216
Response IP 192.236.16.216
Found Yes
Hash 35d0221dcfa1b34a53d07d21408fde4b9f05d2d52e9be79d628cc9bba19dab47
SimHash 0d4ec8a6e013

Groups

*

Rule Path
Disallow /_layouts/
Disallow /_vti_bin/
Disallow /_catalogs/
Disallow /mcareers/pr/
Disallow /mcareers/hr/
Disallow /content/
Disallow /SiteAssets/
Disallow /education/StudentPrograms/FacultyAppointmentDocuments/

yandex

Rule Path
Disallow /

baiduspider
baiduspider-video
baiduspider-image

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.marshfieldclinic.org:443/sitemap.xml

Warnings

  • 1 invalid line.