topcafe.su
robots.txt

Robots Exclusion Standard data for topcafe.su

Resource Scan

Scan Details

Site Domain topcafe.su
Base Domain topcafe.su
Scan Status Ok
Last Scan2025-11-25T09:18:54+00:00
Next Scan 2025-12-02T09:18:54+00:00

Last Scan

Scanned2025-11-25T09:18:54+00:00
URL https://topcafe.su/robots.txt
Domain IPs 87.236.16.74
Response IP 87.236.16.74
Found Yes
Hash 0bc8e28f05ab6dc37d43c992a5d4e2515e8684239a0ac9ac120730e1f7944f2a
SimHash 4f2055540638

Groups

*

Rule Path
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-login.php
Disallow /wp-register.php
Disallow /xmlrpc.php
Disallow /search
Disallow */trackback/
Disallow */feed
Disallow *?*
Disallow */attachment/*
Disallow /author/
Disallow */category/*
Disallow */print/
Disallow *?print=*
Disallow /wp-json*
Disallow */page/
Disallow *possible__unsafe__site*
Allow /wp-content/uploads/
Allow /wp-content/plugins/*/*?ver*
Allow /wp-includes/js/jquery/*?ver*
Allow /wp-content/themes/*?ver*
Allow /wp-content/themes/*/img/
Allow /wp-includes/js/*?ver=
Allow */feed/zen
Allow */feed/mihdan-mailru-pulse-feed/

Other Records

Field Value
crawl-delay 5

googlebot-image

Rule Path
Allow /wp-content/uploads/

yandeximages

Rule Path
Allow /wp-content/uploads/

mediapartners-google

Rule Path
Disallow

yadirectbot

Rule Path
Disallow

Other Records

Field Value
sitemap https://topcafe.su/sitemap.xml

Warnings

  • `host` is not a known field.