harthaus.ru
robots.txt

Robots Exclusion Standard data for harthaus.ru

Resource Scan

Scan Details

Site Domain harthaus.ru
Base Domain harthaus.ru
Scan Status Ok
Last Scan4/4/2025, 9:30:49 PM
Next Scan 4/11/2025, 9:30:49 PM

Last Scan

Scanned4/4/2025, 9:30:49 PM
URL https://harthaus.ru/robots.txt
Domain IPs 2a03:6f00:6:1::517:321b, 5.23.50.27
Response IP 5.23.50.27
Found Yes
Hash c915c363f5d46b51436315072eda114858705b5c735016fe276c9e1e63540174
SimHash 6b38ef020631

Groups

*

Rule Path
Disallow /wp-includes
Disallow /wp-feed
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /wp-content/themes
Disallow /search

Other Records

Field Value
crawl-delay 1024

googlebot-image

Rule Path
Allow /wp-content/uploads/

yandeximages

Rule Path
Allow /wp-content/uploads/

yandex

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1024

Other Records

Field Value
sitemap https://harthaus.ru/sitemap.xml
sitemap https://harthaus.ru/sitemap.xml.gz

Warnings

  • `host` is not a known field.