istmira.com
robots.txt

Robots Exclusion Standard data for istmira.com

Resource Scan

Scan Details

Site Domain istmira.com
Base Domain istmira.com
Scan Status Ok
Last Scan2024-10-31T19:00:48+00:00
Next Scan 2024-11-07T19:00:48+00:00

Last Scan

Scanned2024-10-31T19:00:48+00:00
URL https://istmira.com/robots.txt
Redirect http://www.istmira.com/robots.txt
Redirect Domain www.istmira.com
Redirect Base istmira.com
Domain IPs 168.119.91.88
Redirect IPs 168.119.91.88
Response IP 168.119.91.88
Found Yes
Hash 61e9d9069dd4bdd3141f12b52bd2d89f2310bccc4651dba9d46a9ec52435fb7d
SimHash 754d9871a031

Groups

gptbot

Rule Path
Disallow /

*

Rule Path
Disallow /engine/go.php
Disallow /user/
Disallow /newposts/
Disallow /page/
Disallow /lastnews/
Disallow /tags/
Disallow /statistics.html
Disallow /*subaction%3Duserinfo
Disallow /*subaction%3Dnewposts
Disallow /*do%3Dlastcomments
Disallow /*do%3Dfeedback
Disallow /*do%3Dregister
Disallow /*do%3Dlostpassword
Disallow /*do%3Daddnews
Disallow /*do%3Dstats
Disallow /*do%3Dpm
Disallow /*do%3Dsearch
Disallow /*do%3Ddownload
Disallow /*do%3Dgo
Disallow /engine/download.php
Disallow /print/
Disallow /index.php?do=search
Disallow /2010/
Disallow /2011/
Disallow /2012/
Disallow /2013/
Disallow /2014/
Disallow /2015/
Disallow /2016/
Disallow /2017/
Disallow /2018/
Disallow /2019/
Disallow /category/
Disallow /*.php$
Disallow /*?*

Warnings

  • `host` is not a known field.