webdianoia.com
robots.txt

Robots Exclusion Standard data for webdianoia.com

Resource Scan

Scan Details

Site Domain webdianoia.com
Base Domain webdianoia.com
Scan Status Ok
Last Scan2024-09-20T14:11:48+00:00
Next Scan 2024-09-27T14:11:48+00:00

Last Scan

Scanned2024-09-20T14:11:48+00:00
URL https://webdianoia.com/robots.txt
Domain IPs 203.161.53.96
Response IP 203.161.53.96
Found Yes
Hash c2361800cc92c8b553e9708a304c1aae770dedd014ff222be1ac3ee99a9b097f
SimHash ad47700c6bc1

Groups

httrack website <span class="il_ad" id="il_ad1">copier</span>

Rule Path
Disallow /

wget

Rule Path
Disallow /

ecatch

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

surfoffline

Rule Path
Disallow /

websiteripper

Rule Path
Disallow /

web2disk

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webzip

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

web downloader

Rule Path
Disallow /

offline explorer pro

Rule Path
Disallow /

offline commander

Rule Path
Disallow /

leech

Rule Path
Disallow /

websnake

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

http weazel

Rule Path
Disallow /

acrobat\ webcapture

Rule Path
Disallow /

web dumper

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/
Disallow /blog/
Disallow /buscar/
Disallow /Templates/