archwired.com
robots.txt

Robots Exclusion Standard data for archwired.com

Resource Scan

Scan Details

Site Domain archwired.com
Base Domain archwired.com
Scan Status Ok
Last Scan2024-11-15T19:57:42+00:00
Next Scan 2024-11-22T19:57:42+00:00

Last Scan

Scanned2024-11-15T19:57:42+00:00
URL https://archwired.com/robots.txt
Domain IPs 50.31.114.12
Response IP 50.31.114.12
Found Yes
Hash 09486e37d6490099794658c68e04852ff9ffbef50ab6782bb67f8d63c4eebcaa
SimHash 2014426a5493

Groups

googlebot

Rule Path
Allow /ads.txt

googlebot-image

Rule Path
Disallow

msnbot

Rule Path
Disallow

slurp

Rule Path
Disallow

teoma

Rule Path
Disallow

gigabot

Rule Path
Disallow

scrubby

Rule Path
Disallow

robozilla

Rule Path
Disallow

nutch

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow

baiduspider

Rule Path
Disallow /

yahoo-mmcrawler

Rule Path
Disallow

psbot

Rule Path
Disallow

asterias

Rule Path
Disallow /

yahoo-blogs/v3.9

Rule Path
Disallow

aboundex

Rule Path
Disallow /

ahrefs

Rule Path
Disallow /

amazon_ec2_n_america

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-ads

Rule Path
Disallow /

baiduspider-cpro

Rule Path
Disallow /

baiduspider-favo

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

baiduspider-news

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

bixolabs

Rule Path
Disallow /

brandimensionsblock

Rule Path
Disallow /

butterfly

Rule Path
Disallow /

compspybot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

dotnetdotcom

Rule Path
Disallow /

exaleadcloudviewcrawler

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

exabot

Rule Path
Disallow /

genieo

Rule Path
Disallow /

geohasher

Rule Path
Disallow /

hetzneronlineblock

Rule Path
Disallow /

huaweisymantecspider

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

indy library

Rule Path
Disallow /

jakarta commons

Rule Path
Disallow /

java

Rule Path
Disallow /

jskitboturlresolver

Rule Path
Disallow /

larbin

Rule Path
Disallow /

linkdex.com

Rule Path
Disallow /

linkdexbot

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

lmqueuebot

Rule Path
Disallow /

lynx

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

mrsputnik

Rule Path
Disallow /

netseer

Rule Path
Disallow /

news bot

Rule Path
Disallow /

nextgensearchbot

Rule Path
Disallow /

ning

Rule Path
Disallow /

nutch

Rule Path
Disallow /

obot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

paloaltonetworksblock

Rule Path
Disallow /

panscient

Rule Path
Disallow /

paperlibot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

scrapebox

Rule Path
Disallow /

sistrix crawler

Rule Path
Disallow /

sistrixcrawler

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

spbot

Rule Path
Disallow /

tweetmemebot

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

twitterbot

Rule Path
Disallow /

voilabot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

yandeximages

Rule Path
Disallow /

yeti

Rule Path
Disallow /

add catalog

Rule Path
Disallow /

buibui-bot

Rule Path
Disallow /

discobot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

exalead

Rule Path
Disallow /

fastbot crawler

Rule Path
Disallow /

flightdeckreportsbot

Rule Path
Disallow /

iltrovatore-setaccio

Rule Path
Disallow /

ip-web-crawler.com

Rule Path
Disallow /

linguee

Rule Path
Disallow /

luminator

Rule Path
Disallow /

lwp

Rule Path
Disallow /

offbyone

Rule Path
Disallow /

pagesinventory

Rule Path
Disallow /

picsearch

Rule Path
Disallow /

powermarks

Rule Path
Disallow /

psbot

Rule Path
Disallow /

rackspace-block

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

seo analyser bot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

siteexplorer

Rule Path
Disallow /

softlayerblock

Rule Path
Disallow /

solomonobot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

wbsearchbot1

Rule Path
Disallow /

webcapture

Rule Path
Disallow /

wget

Rule Path
Disallow /

xpymep.exe

Rule Path
Disallow /

zend_http_client

Rule Path
Disallow /

zoomspider

Rule Path
Disallow /

*

Rule Path
Disallow
Disallow /cgi-bin/
Disallow /NewBackup/
Disallow /SecondNewDirectory/
Disallow /_phpmyadmin/
Disallow /argh/
Disallow /etc/
Disallow /NewBackup/
Disallow /logs/
Disallow /phpmyadmin/
Disallow /phpmyadmin_27/
Disallow /phpmyadmin_2/
Disallow /phpmyadmin_damnthis/
Disallow /tmp/
Disallow /phpbb2_before_apr_2006/
Disallow /phpbb2_old_first/
Disallow /phpbb2_old_second/
Disallow /phpbb2_old_third/
Disallow /DentaKit/
Disallow /images/froogle/
Disallow /images/dentakit/
Disallow /phpbb2/viewtopic.php?t=23304
Disallow /phpbb2/viewtopic.php?t=23201Sitemap%3Ahttp%3A%2F%2Farchwired.com%2Fsitemap.xml

Other Records

Field Value
sitemap http://cdn.attracta.com/sitemap/218352.xml.gz

Comments

  • robots.txt generated at www.mcanerin.com

Warnings

  • 4 invalid lines.