insuretechconnect.com
robots.txt

Robots Exclusion Standard data for insuretechconnect.com

Resource Scan

Scan Details

Site Domain insuretechconnect.com
Base Domain insuretechconnect.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-07-14T15:19:59+00:00
Next Scan 2024-10-12T15:19:59+00:00

Last Successful Scan

Scanned2023-08-27T15:17:15+00:00
URL https://insuretechconnect.com/robots.txt
Domain IPs 104.21.18.92, 172.67.181.136, 2606:4700:3031::6815:125c, 2606:4700:3036::ac43:b588
Response IP 104.21.18.92
Found Yes
Hash f4fc6f4ce9f8626a23840fa048ccc5211465e1380a2b35069f2cfd6952761ee7
SimHash 4260434a33b8

Groups

adidxbot
ahrefsbot
aihitbot
alphaseobot
alphaseobot-sa
baiduspider
bingpreview
blexbot
careerbot
cliqzbot
dotbot
grapeshot
ichiro
icjobs
linkdexbot
magpie-crawler
megaindex
mj12bot
moget
naverbot
owlin
owlin bot
owlin bot v. 3.0
proximic
queryseekerspider
scrapy
scrapybot
semrushbot
sentibot
seokicks-robot
sogou
sogou spider
tkbot
trendkite-akashic-crawler
vagabondo
wbsearchbot
yandex
yandexbot
yeti
youdaobot

Rule Path
Disallow /

*

Rule Path
Allow /wp-includes/js/

*

Rule Path
Disallow /wp-admin/

*

Rule Path
Disallow /wp-includes/

*

Rule Path
Disallow /xmlrpc.php

*

Rule Path
Disallow /profile

*

Rule Path
Disallow /cgi-bin/

*

Rule Path
Disallow /wp-content/cache/

*

Rule Path
Disallow /trackback/

*

Rule Path
Disallow /comments/

*

Rule Path
Disallow /administrator/

*

Rule Path
Disallow */trackback/

*

Rule Path
Disallow */comments/

*

Rule Path
Disallow /license.txt

*

Rule Path
Disallow /*.php$

*

Rule Path
Disallow *?filter

*

Rule Path
Disallow /wp-content/themes/

*

Rule Path
Disallow /readme.html

Comments

  • Horrible bandwidth eating robots