24smi.org
robots.txt

Robots Exclusion Standard data for 24smi.org

Resource Scan

Scan Details

Site Domain 24smi.org
Base Domain 24smi.org
Scan Status Ok
Last Scan2024-11-13T09:12:43+00:00
Next Scan 2024-11-20T09:12:43+00:00

Last Scan

Scanned2024-11-13T09:12:43+00:00
URL https://24smi.org/robots.txt
Domain IPs 104.22.32.210, 104.22.33.210, 172.67.12.90, 2606:4700:10::6816:20d2, 2606:4700:10::6816:21d2, 2606:4700:10::ac43:c5a
Response IP 104.22.33.210
Found Yes
Hash 3c14278e156d14583d1ae77367dc16d0c428e501f7d7539064fe09e9e44a6770
SimHash 2dcd2e65af93

Groups

yandexbot

Rule Path
Disallow *.swf
Disallow /top/
Disallow /news/embed/
Disallow /en/celebrity/tag/
Disallow /celebrity/search/
Disallow /promocode/shops/search/

googlebot

Rule Path
Disallow *.swf
Disallow /top/
Disallow /news/embed/
Disallow /en/celebrity/tag/
Disallow /celebrity/search/
Disallow /promocode/shops/search/

*

Rule Path
Disallow *.swf
Disallow /top/
Disallow /news/embed/
Disallow /en/celebrity/tag/
Disallow /celebrity/search/
Disallow /promocode/shops/search/

Other Records

Field Value
sitemap https://24smi.org/sitemap.xml
sitemap https://24smi.org/image-sitemap.xml

Warnings

  • `clean-param` is not a known field.
  • `host` is not a known field.