glavpost.com
robots.txt

Robots Exclusion Standard data for glavpost.com

Resource Scan

Scan Details

Site Domain glavpost.com
Base Domain glavpost.com
Scan Status Ok
Last Scan2024-09-17T00:05:46+00:00
Next Scan 2024-09-24T00:05:46+00:00

Last Scan

Scanned2024-09-17T00:05:46+00:00
URL https://glavpost.com/robots.txt
Response IP 5.45.115.149
Found Yes
Hash 1f3488640975a2c585b5dd56ee18cc372708d1ed3a9bd8c67e4f8fdc661b90f0
SimHash 7d31ab30d5f1

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-
Disallow /wp/
Disallow /?
Disallow *?s=
Disallow *%26s%3D
Disallow /search
Disallow /author/
Disallow */embed$
Disallow */page/
Disallow */xmlrpc.php
Disallow *utm*%3D
Disallow *openstat%3D
Allow */wp-*/*ajax*.php
Allow */sitemap.xml
Allow */uploads
Allow */wp-*/*.js
Allow */wp-*/*.css
Allow */wp-*/*.png
Allow */wp-*/*.jpg
Allow */wp-*/*.jpeg
Allow */wp-*/*.gif
Allow */wp-*/*.svg
Allow */wp-*/*.webp
Allow */wp-*/*.swf
Allow */wp-*/*.pdf

Other Records

Field Value
sitemap /sitemap.xml