thewikihow.com
robots.txt

Robots Exclusion Standard data for thewikihow.com

Resource Scan

Scan Details

Site Domain thewikihow.com
Base Domain thewikihow.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-08-18T14:22:12+00:00
Next Scan 2024-11-16T14:22:12+00:00

Last Successful Scan

Scanned2023-04-26T21:41:50+00:00
URL https://thewikihow.com/robots.txt
Domain IPs 95.216.142.131
Response IP 95.216.142.131
Found Yes
Hash 578b865a4b797191a8a00dd5c331a02f5eee19a3a6442d5487f494fd0b29f45f
SimHash 4e1fdff07213

Groups

mauibot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

charlotte

Rule Path
Disallow /

linguatools

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

detectify

Rule Path
Disallow /

riddler

Rule Path
Disallow /

speedy

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

ocelli

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

shopwiki

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

flipboardproxy

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

trovitbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

riddlerbot

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

yandex

Rule Path
Disallow /amp*

*

Rule Path
Disallow

Other Records

Field Value
crawl-delay 0.5

Warnings

  • 2 invalid lines.
  • `clean-param` is not a known field.
  • `host` is not a known field.