turkru.love
robots.txt

Robots Exclusion Standard data for turkru.love

Resource Scan

Scan Details

Site Domain turkru.love
Base Domain turkru.love
Scan Status Ok
Last Scan2024-09-21T05:12:06+00:00
Next Scan 2024-09-28T05:12:06+00:00

Last Scan

Scanned2024-09-21T05:12:06+00:00
URL https://turkru.love/robots.txt
Domain IPs 188.116.26.253
Response IP 188.116.26.253
Found Yes
Hash e8b809bca75964ffd0ef2113d9716a6b0d32d7837dd30e412a62afd80ea14d41
SimHash d90dc4424539

Groups

*

Rule Path
Disallow /engine/go.php
Disallow /engine/download.php
Disallow /user/*
Disallow /newposts/
Disallow /statistics.html
Disallow /*subaction%3Duserinfo
Disallow /*subaction%3Dnewposts
Disallow /*do%3Dlastcomments
Disallow /*do%3Dfeedback
Disallow /*do%3Dregister
Disallow /*do%3Dlostpassword
Disallow /*do%3Daddnews
Disallow /*do%3Dstats
Disallow /*do%3Dpm
Disallow /*do%3Dsearch
Disallow /xfsearch/
Disallow /print
Disallow /*print%3A*
Disallow /*do%3Dorderdesc
Disallow /*do%3Dauth-social
Disallow /user/favorites/
Disallow /favorites/
Disallow */favorites/
Disallow /favorites/*
Disallow */*%3Dorderdesc
Disallow */%3Dorderdesc
Disallow /index.php?do=orderdesc
Disallow /page/*
Disallow */?*

blexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

criteobot/0.1

Rule Path
Disallow /

ttd-content

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap https://turkru.love/sitemap.xml

Warnings

  • `host` is not a known field.