wholesalepoint.com
robots.txt

Robots Exclusion Standard data for wholesalepoint.com

Resource Scan

Scan Details

Site Domain wholesalepoint.com
Base Domain wholesalepoint.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-08-10T07:58:20+00:00
Next Scan 2024-11-08T07:58:20+00:00

Last Successful Scan

Scanned2024-03-21T06:57:36+00:00
URL https://wholesalepoint.com/robots.txt
Domain IPs 104.26.6.251, 104.26.7.251, 172.67.70.97, 2606:4700:20::681a:6fb, 2606:4700:20::681a:7fb, 2606:4700:20::ac43:4661
Response IP 172.67.70.97
Found Yes
Hash 6eff7a466707a981931d10f4476ba6410433b6b92b677b2aa0e83cbca5a701a0
SimHash 915f5a02ccf1

Groups

aspiegelbot

Rule Path
Disallow /

*
googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

ahrefsbot
mj12bot
dotbot
semrushbot
rogerbot
screaming frog seo spider
scoutjet
linkdex
moatbot
adbeat_bot
xenu

Rule Path
Disallow /

feedfetcher

Rule Path
Disallow /

mediapartners-google*

Rule Path
Disallow /

ubicrawler
doc
zao

Rule Path
Disallow /

sitecheck.internetseer.com
zealbot
msiecrawler
sitesnagger
webstripper
webcopier
fetch
offline explorer
teleport
teleportpro
webzip
linko
httrack
microsoft.url.control
larbin
libwww
zyborg
download ninja
wget
grub-client
npbot
webreaper
k2spider
charlotte
litefinder
fatbot
yahooseeker

Rule Path
Disallow /

aspiegelbot
compspybot
curious george
cybeye.com
docomo
exb language crawler
ezooms
flamingo_searchengine
genieo
genio
gib
lwnutch
lexxebot
nutch
gigabot
openwebindex
rediffnewsbot
seoengworldbot
scanmine
shopwiki
showyoubot
sosospider
wocbot
yeti
youdaobot
daumoa
gsa-crawler
libcrawl
magpie-crawler
repparser
sindice-site-manager
woriobot
yacybot
yolinkbot

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.wholesalepoint.com/sitemap.xml

Comments

  • Wholesalepoint.com Robots.txt
  • Google
  • SEO/SEM Competitor Tool Bot Block
  • Exploitable Google bot
  • Advertising related bots
  • Bots that obey Robots.txt block
  • Site Copiers that do not always obey robots.txt (might need to be redirected)
  • Others to be blocked

Warnings

  • `noindex` is not a known field.