mix.com.tj
robots.txt

Robots Exclusion Standard data for mix.com.tj

Resource Scan

Scan Details

Site Domain mix.com.tj
Base Domain mix.com.tj
Scan Status Ok
Last Scan2025-03-13T01:51:48+00:00
Next Scan 2025-03-20T01:51:48+00:00

Last Scan

Scanned2025-03-13T01:51:48+00:00
URL https://mix.com.tj/robots.txt
Redirect https://mix.tj/robots.txt
Redirect Domain mix.tj
Redirect Base mix.tj
Domain IPs 217.11.180.61
Redirect IPs 217.11.180.61
Response IP 217.11.180.61
Found Yes
Hash 4aaabfb400ef7e5494a132a97989f7d5f3649fd67551e84d4be3d8b62e61ec95
SimHash 520fe96042f0

Groups

*

Rule Path
Disallow /*do%3Dpoisk
Disallow /*do%3Dlostpassword
Disallow /play/
Disallow /embed/
Disallow /poisk/
Disallow /cat/anime*
Disallow /cat/amv*
Disallow /go/last?cat=amv*
Disallow /go/last?cat=anime*

imagesiftbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

keys-so-bot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

riddler

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

trovitbot

Rule Path
Disallow /

detectify

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

mediatoolkitbot

Rule Path
Disallow /

flipboardproxy

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

linguee

Rule Path
Disallow /

ccbot

Rule Path
Disallow /