truecrime.guru
robots.txt

Robots Exclusion Standard data for truecrime.guru

Resource Scan

Scan Details

Site Domain truecrime.guru
Base Domain truecrime.guru
Scan Status Ok
Last Scan2024-09-22T10:12:55+00:00
Next Scan 2024-09-29T10:12:55+00:00

Last Scan

Scanned2024-09-22T10:12:55+00:00
URL https://truecrime.guru/robots.txt
Redirect http://www.truecrime.guru/robots.txt
Redirect Domain www.truecrime.guru
Redirect Base truecrime.guru
Domain IPs 45.83.192.72
Redirect IPs 45.83.192.72
Response IP 45.83.192.72
Found Yes
Hash 05e714d6ec6b28c15f2a2427f77cfcbb3026941f782de852a2a4f476cf75fe15
SimHash 0b444a5005b4

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Allow /$
Allow /*sitemap
Allow /*rss
Allow /*type%3Drss
Allow /*board
Allow /*topic
Disallow /attachments/
Disallow /cache/
Disallow /avatars/
Disallow /index.php/board%2C33.0.html
Disallow /Packages/
Disallow /Smileys/
Disallow /Sources/
Disallow /Themes/
Disallow /Games/
Disallow /*sort
Disallow /*topicseen
Disallow /*wap
Disallow /*wap2
Disallow /*imode
Disallow /*action
Disallow /index.php?*action=
Disallow /*prev_next
Disallow /*all
Disallow /*PHPSESSID
Disallow /*%3B
Disallow /*ID

twiceler

Rule Path
Disallow /

w3c-checklink

Rule Path
Disallow /

yandeximageresizer

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.truecrime.guru/sitemap.xml

Warnings

  • `host` is not a known field.