webtwister.com
robots.txt

Robots Exclusion Standard data for webtwister.com

Resource Scan

Scan Details

Site Domain webtwister.com
Base Domain webtwister.com
Scan Status Ok
Last Scan2025-04-28T23:01:06+00:00
Next Scan 2025-05-28T23:01:06+00:00

Last Scan

Scanned2025-04-28T23:01:06+00:00
URL https://webtwister.com/robots.txt
Domain IPs 192.185.108.132
Response IP 192.185.108.132
Found Yes
Hash 3bc4b331011a1ada142a66cd8515758d9c8827d4874ba85d9bd6daa4f14cd14d
SimHash 200683806054

Groups

newscan-online
ecollector
cmc/0.01
googlebot-image

Rule Path
Disallow /

*
atn_worldwide
scooter
grabber
anzwerscrawl
architextspider
fast-webcrawler
googlebot
fido
slurp
lycos_spider_(t-rex)
gulliver
t-h-u-n-d-e-r-s-t-o-n-e
internet cruiser robot
topiclink
jcrawler
whowhere
winona

Rule Path
Disallow /Templates/
Disallow /cgi-bin/
Disallow /controlpanel/
Disallow /dsm/
Disallow /images/
Disallow /includes/
Disallow /stats/

Comments

  • NO access (newscan, e-collector, CMC/0.01, Google Image)
  • PARTIAL access (All Spiders, AllThatNet, Alta Vista, Direct Hit Grabber, Anzwers, Excite, FAST/AllTheWeb, Google, PlanetSearch, Inktomi, Lycos, Northern Light, Thunderstone, whatUseek, Internet Cruiser, TopicLink, VietGATE, WhoWhere?)