freesmi.by
robots.txt

Robots Exclusion Standard data for freesmi.by

Resource Scan

Scan Details

Site Domain freesmi.by
Base Domain freesmi.by
Scan Status Ok
Last Scan2024-11-14T04:28:49+00:00
Next Scan 2024-11-21T04:28:49+00:00

Last Scan

Scanned2024-11-14T04:28:49+00:00
URL https://freesmi.by/robots.txt
Domain IPs 165.22.95.139
Response IP 165.22.95.139
Found Yes
Hash e376176445cba52d5be6ad4aedb1e4d65318005ef8468ac186ccad8fa4109fe2
SimHash 634486608375

Groups

pr-cy-bot

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /xmlrpc.php
Disallow /wp-content/themes
Disallow /trackback
Disallow */trackback
Disallow */*/trackback
Disallow /*?*
Disallow /tag
Disallow /lydi
Disallow /temi
Disallow /strani
Disallow /istochniki
Disallow /organizacii
Disallow /regiony
Disallow /sample-page
Disallow */page/*

Other Records

Field Value
crawl-delay 5

yandex

Rule Path
Allow /yandex/news
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /xmlrpc.php
Disallow /wp-content/themes
Disallow /trackback
Disallow */trackback
Disallow */*/trackback
Disallow /*?*
Disallow /tag
Disallow /lydi
Disallow /temi
Disallow /strani
Disallow /istochniki
Disallow /organizacii
Disallow /regiony
Disallow /sample-page
Disallow */page/*

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://freesmi.by/sitemap_index.xml

Warnings

  • `host` is not a known field.