ilenta.com
robots.txt

Robots Exclusion Standard data for ilenta.com

Resource Scan

Scan Details

Site Domain ilenta.com
Base Domain ilenta.com
Scan Status Ok
Last Scan2024-11-11T10:18:32+00:00
Next Scan 2024-11-18T10:18:32+00:00

Last Scan

Scanned2024-11-11T10:18:32+00:00
URL https://ilenta.com/robots.txt
Domain IPs 89.184.76.249
Response IP 89.184.76.249
Found Yes
Hash 629a5a50b59d438df74f1739b124472bc71e2cf2dfdff903d3c0afef51f1ae72
SimHash 4a481ac65433

Groups

*

Rule Path
Allow /
Disallow /netcat/*
Disallow /go.php?*
Disallow /news/?curPos*

mediapartners-google

Rule Path
Allow /
Disallow /netcat/*

yandex

Rule Path
Allow /
Disallow /netcat/*
Disallow /news/news.rss
Disallow /tags/*
Disallow /news/?curPos*
Disallow /go.php?*

Other Records

Field Value
sitemap https://ilenta.com/sitemap/news/
sitemap https://ilenta.com/sitemap/applications/
sitemap https://ilenta.com/sitemap/ps/
sitemap https://ilenta.com/sitemap/
sitemap https://ilenta.com/uk/sitemap/news/

Warnings

  • `host` is not a known field.