turkru.nl
robots.txt

Robots Exclusion Standard data for turkru.nl

Resource Scan

Scan Details

Site Domain turkru.nl
Base Domain turkru.nl
Scan Status Ok
Last Scan2024-11-08T15:56:57+00:00
Next Scan 2024-11-15T15:56:57+00:00

Last Scan

Scanned2024-11-08T15:56:57+00:00
URL https://turkru.nl/robots.txt
Domain IPs 104.21.14.208, 172.67.160.143, 2606:4700:3035::6815:ed0, 2606:4700:3037::ac43:a08f
Response IP 104.21.14.208
Found Yes
Hash bc04ea09751871d8a064bb1efe650b48dd2b38fc6945f01bf9d202d1adc09eab
SimHash c908ac52c1b3

Groups

*

Rule Path
Disallow /engine/go.php
Disallow /engine/download.php
Disallow /user/*
Disallow /newposts/
Disallow /statistics.html
Disallow /*subaction%3Duserinfo
Disallow /*subaction%3Dnewposts
Disallow /*do%3Dlastcomments
Disallow /*do%3Dfeedback
Disallow /*do%3Dregister
Disallow /*do%3Dlostpassword
Disallow /*do%3Daddnews
Disallow /*do%3Dstats
Disallow /*do%3Dpm
Disallow /*do%3Dsearch
Disallow /xfsearch/
Disallow /?jhm
Disallow /print
Disallow /*print%3A*
Disallow /*do%3Dorderdesc
Disallow /*do%3Dauth-social
Disallow /user/favorites/
Disallow /f/
Disallow /f/*
Disallow /favorites/
Disallow */favorites/
Disallow /favorites/*
Disallow */*%3Dorderdesc
Disallow */%3Dorderdesc
Disallow /index.php?do=orderdesc
Disallow /page/*
Disallow */?*

Other Records

Field Value
sitemap https://turkru.nl/sitemap.xml

Warnings

  • `clean-param` is not a known field.
  • `host` is not a known field.