wh40k.lexicanum.com
robots.txt

Robots Exclusion Standard data for wh40k.lexicanum.com

Resource Scan

Scan Details

Site Domain wh40k.lexicanum.com
Base Domain lexicanum.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-05-25T02:59:23+00:00
Next Scan 2024-06-24T02:59:23+00:00

Last Successful Scan

Scanned2024-04-02T23:21:31+00:00
URL https://wh40k.lexicanum.com/robots.txt
Domain IPs 104.26.14.60, 104.26.15.60, 172.67.74.157, 2606:4700:20::681a:e3c, 2606:4700:20::681a:f3c, 2606:4700:20::ac43:4a9d
Response IP 104.26.15.60
Found Yes
Hash e38f4f2b5a144bac7e25650693097c0b014fc7f77ddb01fd6db8bf9c0bf17dbe
SimHash ef0e5ca04174

Groups

*

Rule Path
Disallow /mediawiki/extensions
Disallow /mediawiki/maintenance
Disallow /mediawiki/includes
Disallow /mediawiki/docs
Disallow /mediawiki/bin
Disallow /mediawiki/cache
Disallow /mediawiki/languages
Disallow /mediawiki/mw-config
Disallow /mediawiki/serialized
Disallow /mediawiki/resources
Disallow /mediawiki/skins.bak

Other Records

Field Value
crawl-delay 5

turnitinbot

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

simplepie

Rule Path
Disallow /

speedy

Rule Path
Disallow /

jobs.de-robot

Rule Path
Disallow /