turkru.land
robots.txt

Robots Exclusion Standard data for turkru.land

Resource Scan

Scan Details

Site Domain turkru.land
Base Domain turkru.land
Scan Status Ok
Last Scan2024-09-02T00:35:42+00:00
Next Scan 2024-10-02T00:35:42+00:00

Last Scan

Scanned2024-09-02T00:35:42+00:00
URL https://turkru.land/robots.txt
Domain IPs 104.21.16.22, 172.67.165.248, 2606:4700:3031::ac43:a5f8, 2606:4700:3032::6815:1016
Response IP 172.67.165.248
Found Yes
Hash 283fbdb6a589bf94752f4433220538a7dd3e2ab874c792c44bd842e7183abad3
SimHash fd09a4788721

Groups

*

Rule Path
Allow /engine/classes/min/*
Allow /engine/data/emoticons/*
Disallow /engine/
Disallow /engine/go.php
Disallow /user/
Disallow /newposts/
Disallow /statistics.html
Disallow /*subaction%3Duserinfo
Disallow /*subaction%3Dnewposts
Disallow /*do%3Dlastcomments
Disallow /*do%3Dfeedback
Disallow /*do%3Dregister
Disallow /*do%3Dlostpassword
Disallow /*do%3Daddnews
Disallow /*do%3Dstats
Disallow /*do%3Dpm
Disallow /*do%3Dsearch
Disallow /*do%3Ddownload
Disallow /*do%3Dgo

Other Records

Field Value
sitemap https://turkru.land/sitemap.xml

Warnings

  • `host` is not a known field.