childrensomaha.org
robots.txt

Robots Exclusion Standard data for childrensomaha.org

Resource Scan

Scan Details

Site Domain childrensomaha.org
Base Domain childrensomaha.org
Scan Status Ok
Last Scan2024-05-29T16:21:03+00:00
Next Scan 2024-06-28T16:21:03+00:00

Last Scan

Scanned2024-05-29T16:21:03+00:00
URL https://childrensomaha.org/robots.txt
Redirect https://www.childrensnebraska.org/robots.txt
Redirect Domain www.childrensnebraska.org
Redirect Base childrensnebraska.org
Domain IPs 104.18.22.222, 104.18.23.222, 2606:4700::6812:16de, 2606:4700::6812:17de
Redirect IPs 104.18.12.121, 104.18.13.121, 2606:4700::6812:c79, 2606:4700::6812:d79
Response IP 104.18.13.121
Found Yes
Hash 37baadfc75ae2333f371791b03a9ee4909c9627c28d35cb53013adcd77155887
SimHash eb5edf6486cb

Groups

baiduspider

Rule Path
Disallow /

sogou pic spider

Rule Path
Disallow /

sogou head spider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou orion spider

Rule Path
Disallow /

sogou-test-spider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

*

Rule Path
Disallow /wp-admin/
Disallow /wp-login.php
Disallow /cgi-bin/
Disallow /medical-provider-images/

Other Records

Field Value
crawl-delay 60

Other Records

Field Value
sitemap https://www.childrensnebraska.org/sitemap_index.xml