turoktv.com
robots.txt

Robots Exclusion Standard data for turoktv.com

Resource Scan

Scan Details

Site Domain turoktv.com
Base Domain turoktv.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-03-10T02:12:26+00:00
Next Scan 2024-06-08T02:12:26+00:00

Last Successful Scan

Scanned2023-05-11T10:32:34+00:00
URL https://turoktv.com/robots.txt
Domain IPs 104.21.49.217, 172.67.194.198, 2606:4700:3034::ac43:c2c6, 2606:4700:3036::6815:31d9
Response IP 104.21.49.217
Found Yes
Hash 9889a2b12e509d593109a5fcab801e796080a216e3dd3f9e320a343b032d811c
SimHash 5109c42081b3

Groups

*

Rule Path
Disallow /user/
Disallow /newposts/
Disallow /lastnews/
Disallow /statistics.html
Disallow /*subaction%3Duserinfo
Disallow /*subaction%3Dnewposts
Disallow /*do%3Dlastcomments
Disallow /*do%3Dfeedback
Disallow /*do%3Dregister
Disallow /*do%3Dlostpassword
Disallow /*do%3Daddnews
Disallow /*do%3Dstats
Disallow /*do%3Dpm
Disallow /index.php?
Disallow /page%2C
Disallow */page%2C*
Disallow *%7Blostpassword-link%7D*
Disallow /print
Disallow */print%2C*
Disallow /print
Disallow /print%3A
Disallow /search.html
Disallow *?q=

Other Records

Field Value
sitemap https://turoktv.com/sitemap.xml

Comments

  • Disallow: /engine/
  • Disallow: /page/
  • Disallow: */page/*

Warnings

  • `host` is not a known field.