turkru.uk
robots.txt

Robots Exclusion Standard data for turkru.uk

Resource Scan

Scan Details

Site Domain turkru.uk
Base Domain turkru.uk
Scan Status Ok
Last Scan2024-11-05T10:43:37+00:00
Next Scan 2024-11-12T10:43:37+00:00

Last Scan

Scanned2024-11-05T10:43:37+00:00
URL https://turkru.uk/robots.txt
Domain IPs 104.21.1.38, 172.67.152.40, 2606:4700:3035::6815:126, 2606:4700:3037::ac43:9828
Response IP 172.67.152.40
Found Yes
Hash 7da12a2989e89fb5429a296a8c8d71cce3cf820de6e9b6f945beef54e80ae5b4
SimHash c9098c52c433

Groups

*

Rule Path
Disallow /engine/go.php
Disallow /engine/download.php
Disallow /user/*
Disallow /newposts/
Disallow /statistics.html
Disallow /*subaction%3Duserinfo
Disallow /*subaction%3Dnewposts
Disallow /*do%3Dlastcomments
Disallow /*do%3Dfeedback
Disallow /*do%3Dregister
Disallow /*do%3Dlostpassword
Disallow /*do%3Daddnews
Disallow /*do%3Dstats
Disallow /*do%3Dpm
Disallow /*do%3Dsearch
Disallow /xfsearch/
Disallow /?jhm
Disallow /print
Disallow /*print%3A*
Disallow /*do%3Dorderdesc
Disallow /*do%3Dauth-social
Disallow /user/favorites/
Disallow /f/
Disallow /f/*
Disallow /favorites/
Disallow */favorites/
Disallow /favorites/*
Disallow */*%3Dorderdesc
Disallow */%3Dorderdesc
Disallow /index.php?do=orderdesc
Disallow /page/*
Disallow */?*

Other Records

Field Value
sitemap https://turkru.uk/sitemap.xml

Warnings

  • `clean-param` is not a known field.
  • `host` is not a known field.