small-whale.org
robots.txt

Robots Exclusion Standard data for small-whale.org

Resource Scan

Scan Details

Site Domain small-whale.org
Base Domain small-whale.org
Scan Status Ok
Last Scan2024-09-30T17:56:57+00:00
Next Scan 2024-10-07T17:56:57+00:00

Last Scan

Scanned2024-09-30T17:56:57+00:00
URL https://small-whale.org/robots.txt
Domain IPs 185.68.16.177, 2a00:7a60:0:10b1::1
Response IP 185.68.16.177
Found Yes
Hash a47635e0592c100859b3d0205a81ca965c6b36d9bdde57ebda951bc040d3553e
SimHash 4f205c10ff90

Groups

*

Rule Path
Disallow /wp-admin/

yandex

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/
Disallow /wp-trackback
Disallow /wp-feed
Disallow /wp-comments
Disallow /category/
Disallow /author/
Disallow /page/
Disallow */trackback
Disallow */comments
Disallow /*.php

Other Records

Field Value
sitemap https://small-whale.org/sitemap_index.xml

Warnings

  • `host` is not a known field.