intersmi.news
robots.txt

Robots Exclusion Standard data for intersmi.news

Resource Scan

Scan Details

Site Domain intersmi.news
Base Domain intersmi.news
Scan Status Ok
Last Scan2026-03-25T19:02:24+00:00
Next Scan 2026-04-24T19:02:24+00:00

Last Scan

Scanned2026-03-25T19:02:24+00:00
URL https://intersmi.news/robots.txt
Domain IPs 104.21.10.108, 172.67.131.115, 2606:4700:3033::6815:a6c, 2606:4700:3033::ac43:8373
Response IP 172.67.131.115
Found Yes
Hash 757058730dd5562cd9fa28a1861bac8f845e6488fbcc8d3d1b48600c3d32e723
SimHash 631117610724

Groups

yandex

Rule Path
Allow /?feed=news.yandex.ru
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/themes
Disallow /wp-trackback
Disallow /wp-feed
Disallow */trackback
Disallow /?feed=*
Disallow /author
Disallow /xmlrpc.php
Disallow /tag
Disallow /theme
Disallow /*feed
Disallow /*rss
Disallow /*comments
Disallow /?m=*
Disallow /?s=*
Allow /wp-content/uploads/

*

Rule Path
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/themes
Disallow /wp-trackback
Disallow /wp-feed
Disallow /wp-comments
Disallow */trackback
Disallow /?feed=*
Disallow /xmlrpc.php
Disallow /tag
Disallow /theme
Disallow /*feed
Disallow /*rss
Disallow /author
Disallow /*comments
Disallow /?m=*
Disallow /?s=*
Allow /wp-content/uploads/

Other Records

Field Value
sitemap http://intersmi.news/sitemap.xml
sitemap http://intersmi.news/sitemap-news.xml

Warnings

  • `host` is not a known field.