w140.com
robots.txt

Robots Exclusion Standard data for w140.com

Resource Scan

Scan Details

Site Domain w140.com
Base Domain w140.com
Scan Status Ok
Last Scan2025-11-19T06:34:15+00:00
Next Scan 2025-12-19T06:34:15+00:00

Last Scan

Scanned2025-11-19T06:34:15+00:00
URL https://w140.com/robots.txt
Domain IPs 104.21.67.221, 172.67.182.12, 2606:4700:3031::ac43:b60c, 2606:4700:3037::6815:43dd
Response IP 104.21.67.221
Found Yes
Hash 7dbb1b2e40d1b15d3f364266b485ab85f238b1b8f525dbff8a7ae419a619a5ce
SimHash d0544142eaa3

Groups

*

Rule Path
Disallow /tekwiki/wiki/Special%3A*
Disallow /tekwiki/index.php?title=Special%3A*

ahrefsbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

baiduspider
baiduspider-video
baiduspider-image

Rule Path
Disallow *

blexbot/1.0

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

dnyzbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

haosouspider
sogou spider
yisouspider

Rule Path
Disallow /

mail.ru
mail.ru_bot/2.0

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

moget
ichiro

Rule Path
Disallow /

naverbot
yeti

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

yandex

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

Warnings

  • 1 invalid line.