manas.news
robots.txt

Robots Exclusion Standard data for manas.news

Resource Scan

Scan Details

Site Domain manas.news
Base Domain manas.news
Scan Status Ok
Last Scan2024-08-26T08:09:07+00:00
Next Scan 2024-09-25T08:09:07+00:00

Last Scan

Scanned2024-08-26T08:09:07+00:00
URL https://manas.news/robots.txt
Domain IPs 172.66.40.184, 172.66.43.72, 2606:4700:3108::ac42:28b8, 2606:4700:3108::ac42:2b48
Response IP 172.66.40.184
Found Yes
Hash 060022234dd2ebdd375645be7735ace7082e0a6737651f6e2ef33f495982a702
SimHash 25928a720278

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /?
Disallow /wp-
Disallow *?s=
Disallow *%26s%3D
Disallow */trackback
Disallow */embed
Disallow /xmlrpc.php
Disallow *utm%3D
Disallow *openstat%3D
Disallow /readme.html
Disallow *?replytocom
Disallow /cdn-cgi/
Allow */uploads

googlebot

Rule Path
Disallow /cgi-bin
Disallow /?
Disallow /wp-
Disallow *?s=
Disallow *%26s%3D
Disallow */trackback
Disallow */embed
Disallow /xmlrpc.php
Disallow *utm%3D
Disallow *openstat%3D
Disallow /readme.html
Disallow *?replytocom
Allow */uploads
Allow /*/*.js
Allow /*/*.css
Allow /wp-*.png
Allow /wp-*.jpg
Allow /wp-*.jpeg
Allow /wp-*.gif
Allow /wp-admin/admin-ajax.php

yandex

Rule Path
Disallow /cgi-bin
Disallow /?
Disallow /wp-
Disallow *?s=
Disallow *%26s%3D
Disallow */trackback
Disallow */embed
Disallow /xmlrpc.php
Disallow /readme.html
Disallow *?replytocom
Allow */uploads
Allow /*/*.js
Allow /*/*.css
Allow /wp-*.png
Allow /wp-*.jpg
Allow /wp-*.jpeg
Allow /wp-*.gif
Allow /wp-admin/admin-ajax.php

sputnñ–kð’ð¾t

Rule Path
Disallow /

slurñ€

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

duckduckbot

Rule Path
Disallow /

Warnings

  • 2 invalid lines.
  • `clean-param` is not a known field.