gurnick.edu
robots.txt

Robots Exclusion Standard data for gurnick.edu

Resource Scan

Scan Details

Site Domain gurnick.edu
Base Domain gurnick.edu
Scan Status Ok
Last Scan2024-09-23T15:30:57+00:00
Next Scan 2024-10-23T15:30:57+00:00

Last Scan

Scanned2024-09-23T15:30:57+00:00
URL https://gurnick.edu/robots.txt
Domain IPs 13.52.207.110
Response IP 13.52.207.110
Found Yes
Hash 5a576eeb98aeec15724824000fdfb5d787c2cbb5ecb96de5def1f3e1d5c26bfc
SimHash e626093188c2

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-admin/admin-ajax.php
Disallow /calendar/action~posterboard/
Disallow /calendar/action~agenda/
Disallow /calendar/action~oneday/
Disallow /calendar/action~month/
Disallow /calendar/action~week/
Disallow /calendar/action~stream/
Disallow /calendar/action~undefined/
Disallow /calendar/action~http%3A/
Disallow /calendar/action~default/
Disallow /calendar/action~poster/
Disallow /calendar/action~*/
Disallow /*controller%3Dai1ec_exporter_controller*
Disallow /*/action~*/
Disallow /wp-content/uploads/gravity_forms/

petalbot

Rule Path
Disallow /

moget
ichiro

Rule Path
Disallow /

baiduspider
baiduspider-video
baiduspider-image

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

bubing

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

yandexmobilebot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

Other Records

Field Value
sitemap /sitemap.xml