childrensnebraska.org
robots.txt

Robots Exclusion Standard data for childrensnebraska.org

Resource Scan

Scan Details

Site Domain childrensnebraska.org
Base Domain childrensnebraska.org
Scan Status Ok
Last Scan2024-05-27T06:25:59+00:00
Next Scan 2024-06-26T06:25:59+00:00

Last Scan

Scanned2024-05-27T06:25:59+00:00
URL https://childrensnebraska.org/robots.txt
Domain IPs 104.18.12.121, 104.18.13.121, 2606:4700::6812:c79, 2606:4700::6812:d79
Response IP 104.18.12.121
Found Yes
Hash 37baadfc75ae2333f371791b03a9ee4909c9627c28d35cb53013adcd77155887
SimHash eb5edf6486cb

Groups

baiduspider

Rule Path
Disallow /

sogou pic spider

Rule Path
Disallow /

sogou head spider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou orion spider

Rule Path
Disallow /

sogou-test-spider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

*

Rule Path
Disallow /wp-admin/
Disallow /wp-login.php
Disallow /cgi-bin/
Disallow /medical-provider-images/

Other Records

Field Value
crawl-delay 60

Other Records

Field Value
sitemap https://www.childrensnebraska.org/sitemap_index.xml