mezha.media
robots.txt

Robots Exclusion Standard data for mezha.media

Resource Scan

Scan Details

Site Domain mezha.media
Base Domain mezha.media
Scan Status Ok
Last Scan2024-11-11T01:34:34+00:00
Next Scan 2024-11-18T01:34:34+00:00

Last Scan

Scanned2024-11-11T01:34:34+00:00
URL https://mezha.media/robots.txt
Domain IPs 104.26.8.81, 104.26.9.81, 172.67.74.13, 2606:4700:20::681a:851, 2606:4700:20::681a:951, 2606:4700:20::ac43:4a0d
Response IP 172.67.74.13
Found Yes
Hash cf7d8e6dc2d6007cd3223ab0fb55c97dad9861a6a81887b2fdae96578735f47d
SimHash 6d7511759492

Groups

*

Rule Path
Disallow /my-dir/
Disallow /cgi-bin/
Disallow /cdn-cgi/
Disallow */feed/$
Disallow /wp-admin/
Disallow /?
Disallow *?s=
Disallow *%26s%3D
Disallow /search
Disallow */embed$
Disallow */page/
Disallow */xmlrpc.php
Disallow *utm*%3D
Disallow *openstat%3D

googlebot

Rule Path
Disallow /my-dir/
Disallow /cgi-bin/
Disallow /cdn-cgi/
Disallow */feed/$
Disallow /wp-admin/
Disallow /?
Disallow *?s=
Disallow *%26s%3D
Disallow /search
Disallow */embed$
Disallow */page/
Disallow */xmlrpc.php
Disallow *utm*%3D
Disallow *openstat%3D
Disallow /wp-content/gulp-backups/

Other Records

Field Value
sitemap https://mezha.media/sitemap_index.xml
sitemap https://mezha.media/en/sitemap_index.xml
sitemap https://mezha.media/news-sitemap.xml
sitemap https://mezha.media/video-sitemap.xml