hudexchange.com
robots.txt

Robots Exclusion Standard data for hudexchange.com

Resource Scan

Scan Details

Site Domain hudexchange.com
Base Domain hudexchange.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-15T17:31:50+00:00
Next Scan 2025-01-13T17:31:50+00:00

Last Successful Scan

Scanned2024-03-20T17:22:21+00:00
URL https://hudexchange.com/robots.txt
Domain IPs 104.26.14.10, 104.26.15.10, 172.67.70.115, 2606:4700:20::681a:e0a, 2606:4700:20::681a:f0a, 2606:4700:20::ac43:4673
Response IP 172.67.70.115
Found Yes
Hash c321dbaa62b29a207749e5e6ddd166c3cc93082eec5f8ad3c4726d66c5b8c24d
SimHash 0a0ff5d1ea03

Groups

baiduspider-ads

Rule Path
Disallow /

aboundexbot

Rule Path
Disallow /

adnormcrawler

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-cpro

Rule Path
Disallow /

baiduspider-favo

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

baiduspider-news

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

bixolabs

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

butterfly

Rule Path
Disallow /

careerbot

Rule Path
Disallow /

changedetection

Rule Path
Disallow /

compspybot

Rule Path
Disallow /

crazywebcrawler-spider

Rule Path
Disallow /

domainappender

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

exabot

Rule Path
Disallow /

expo9

Rule Path
Disallow /

geliyoo spider

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

linguee

Rule Path
Disallow /

meanpathbot

Rule Path
Disallow /

megaindex

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ncbot

Rule Path
Disallow /

nerdybot

Rule Path
Disallow /

netestate

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

netseer

Rule Path
Disallow /

obot

Rule Path
Disallow /

obot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

seoengbot

Rule Path
Disallow /

seoengworldbot

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

slurp

Rule Path
Disallow /

smtbot

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

spbot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

yyspider

Rule Path
Disallow /

zookabot

Rule Path
Disallow /

zumbot

Rule Path
Disallow /

*

Rule Path
Disallow /administrator/
Disallow /cache/
Disallow /components/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /libraries/
Disallow /logs/
Disallow /modules/
Disallow /plugins/
Disallow /tmp/
Disallow /xmlrpc/
Disallow /counter.php

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap http://hudexchange.com/sitemap.xml

Warnings

  • 4 invalid lines.