salthub.com
robots.txt

Robots Exclusion Standard data for salthub.com

Resource Scan

Scan Details

Site Domain salthub.com
Base Domain salthub.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-05-23T15:09:14+00:00
Next Scan 2024-06-06T15:09:14+00:00

Last Successful Scan

Scanned2024-05-01T13:04:03+00:00
URL https://salthub.com/robots.txt
Domain IPs 104.21.46.56, 172.67.223.228, 2606:4700:3030::ac43:dfe4, 2606:4700:3037::6815:2e38
Response IP 104.21.46.56
Found Yes
Hash 8f1d612ba96b4ddbdafad120317950eae23e5c527295fe1083be2b0fa68ba7a4
SimHash d225b97a8e52

Groups

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

adsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

*

Rule Path
Allow /wp-admin/admin-ajax.php
Allow /*/*.css
Allow /*/*.js
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /readme.html
Disallow /license.txt
Disallow /xmlrpc.php
Disallow /wp-login.php
Disallow /wp-register.php
Disallow *?attachment_id=

*

Rule Path
Disallow /wp-json/
Disallow /?rest_route=

*

Rule Path
Disallow *?s=*
Disallow *?p=*
Disallow *%26p%3D*
Disallow *%26preview%3D*

*

Rule Path
Disallow /feed/
Disallow /feed/$
Disallow /comments/feed
Disallow */feed
Disallow */feed$
Disallow /?feed=
Disallow /wp-feed

*

Rule Path
Disallow /trackback/
Disallow */comments$
Disallow */trackback
Disallow */trackback$
Disallow /wp-comments
Disallow /wp-trackback
Disallow */replytocom%3D

ia_archiver

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

*

Rule Path
Disallow /cart/

*

Rule Path
Disallow /checkout/

*

Rule Path
Disallow /login/

*

Rule Path
Disallow /*?orderby=price
Disallow /*?orderby=rating
Disallow /*?orderby=date
Disallow /*?orderby=price-desc
Disallow /*?orderby=popularity
Disallow /*?filter
Disallow /*?orderby=title
Disallow /*?orderby=desc
Disallow /*add-to-cart%3D*
Disallow /*add_to_wishlist%3D*
Disallow /*?paged=&count=*
Disallow /*?count=*

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

semrushbot-swa

Rule Path
Disallow /

semrushbot-ct

Rule Path
Disallow /

semrushbot-bm

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

xenu

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

giftghostbot

Rule Path
Disallow /

seznam

Rule Path
Disallow /

paperlibot

Rule Path
Disallow /

genieo

Rule Path
Disallow /

dataprovider/6.101

Rule Path
Disallow /

dataprovidersiteexplorer

Rule Path
Disallow /

dazoobot/1.0

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

domainstatsbot/1.0

Rule Path
Disallow /

dotbot/1.1

Rule Path
Disallow /

dubaiindex

Rule Path
Disallow /

ecommercebot

Rule Path
Disallow /

expertsearchspider

Rule Path
Disallow /

feedbin

Rule Path
Disallow /

fetch/2.0a

Rule Path
Disallow /

ffbot/1.0

Rule Path
Disallow /

focusbot/1.1

Rule Path
Disallow /

huaweisymantecspider

Rule Path
Disallow /

huaweisymantecspider/1.0

Rule Path
Disallow /

jobdiggerspider

Rule Path
Disallow /

lemurwebcrawler

Rule Path
Disallow /

lipperheylinkexplorer

Rule Path
Disallow /

lssrocketcrawler/1.0

Rule Path
Disallow /

lyt.srv1.5

Rule Path
Disallow /

miadev/0.0.1

Rule Path
Disallow /

najdi.si/3.1

Rule Path
Disallow /

bountiibot

Rule Path
Disallow /

experibot_v1

Rule Path
Disallow /

bixocrawler

Rule Path
Disallow /

bixocrawler testcrawler

Rule Path
Disallow /

crawler4j

Rule Path
Disallow /

crowsnest/0.5

Rule Path
Disallow /

cukbot

Rule Path
Disallow /

dataprovider/6.92

Rule Path
Disallow /

dblbot/1.0

Rule Path
Disallow /

diffbot/0.1

Rule Path
Disallow /

digg deeper/v1

Rule Path
Disallow /

discobot/1.0

Rule Path
Disallow /

discobot/1.1

Rule Path
Disallow /

discobot/2.0

Rule Path
Disallow /

discoverybot/2.0

Rule Path
Disallow /

dlvr.it/1.0

Rule Path
Disallow /

domainstatsbot/1.0

Rule Path
Disallow /

drupact/0.7

Rule Path
Disallow /

ezooms/1.0

Rule Path
Disallow /

fastbot crawler beta 2.0

Rule Path
Disallow /

fastbot crawler beta 4.0

Rule Path
Disallow /

feedly social

Rule Path
Disallow /

feedly/1.0

Rule Path
Disallow /

feedlybot/1.0

Rule Path
Disallow /

feedspot

Rule Path
Disallow /

feedspotbot/1.0

Rule Path
Disallow /

clickagy intelligence bot v2

Rule Path
Disallow /

classbot

Rule Path
Disallow /

cispa vulnerability notification

Rule Path
Disallow /

cirrusexplorer/1.1

Rule Path
Disallow /

checksem/nutch-1.10

Rule Path
Disallow /

catchbot/5.0

Rule Path
Disallow /

catchbot/3.0

Rule Path
Disallow /

catchbot/2.0

Rule Path
Disallow /

catchbot/1.0

Rule Path
Disallow /

camontspider/1.0

Rule Path
Disallow /

buzzbot/1.0

Rule Path
Disallow /

buzzbot

Rule Path
Disallow /

businessseek.biz_spider

Rule Path
Disallow /

bubing

Rule Path
Disallow /

fyberspider/1.3

Rule Path
Disallow /

findlinks/1.1.6-beta5

Rule Path
Disallow /

g2reader-bot/1.0

Rule Path
Disallow /

findlinks/1.1.6-beta6

Rule Path
Disallow /

findlinks/2.0

Rule Path
Disallow /

findlinks/2.0.1

Rule Path
Disallow /

findlinks/2.0.2

Rule Path
Disallow /

findlinks/2.0.4

Rule Path
Disallow /

findlinks/2.0.5

Rule Path
Disallow /

findlinks/2.0.9

Rule Path
Disallow /

findlinks/2.1

Rule Path
Disallow /

findlinks/2.1.5

Rule Path
Disallow /

findlinks/2.1.3

Rule Path
Disallow /

findlinks/2.2

Rule Path
Disallow /

findlinks/2.5

Rule Path
Disallow /

findlinks/2.6

Rule Path
Disallow /

ffbot/1.0

Rule Path
Disallow /

findlinks/1.0

Rule Path
Disallow /

findlinks/1.1.3-beta8

Rule Path
Disallow /

findlinks/1.1.3-beta9

Rule Path
Disallow /

findlinks/1.1.4-beta7

Rule Path
Disallow /

findlinks/1.1.6-beta1

Rule Path
Disallow /

findlinks/1.1.6-beta1 yacy

Rule Path
Disallow /

findlinks/1.1.6-beta2

Rule Path
Disallow /

findlinks/1.1.6-beta3

Rule Path
Disallow /

findlinks/1.1.6-beta4

Rule Path
Disallow /

bixo

Rule Path
Disallow /

bixolabs/1.0

Rule Path
Disallow /

crawlera/1.10.2

Rule Path
Disallow /

dataprovider site explorer

Rule Path
Disallow /

rogerbot
exabot
mj12bot
dotbot
gigabot
ahrefsbot
blackwidow
chinaclaw
custo
disco
download\ demon
ecatch
eirgrabber
emailsiphon
emailwolf
express\ webpictures
extractorpro
eyenetie
flashget
getright
getweb!
go!zilla
go-ahead-got-it
grabnet
grafula
hmview
httrack
image\ stripper
image\ sucker
indy\ library
interget
internet\ ninja
jetcar
joc\ web\ spider
larbin
leechftp
mass\ downloader
midown\ tool
mister\ pix
navroad
nearsite
netants
netspider
net\ vampire
netzip
octopus
offline\ explorer
offline\ navigator
pagegrabber
papa\ foto
pavuk
pcbrowser
realdownload
reget
sitesnagger
smartdownload
superbot
superhttp
surfbot
takeout
teleport\ pro
voideye
web\ image\ collector
web\ sucker
webauto
webcopier
webfetch
webgo\ is
webleacher
webreaper
websauger
website\ extractor
website\ quester
webstripper
webwhacker
webzip
wget
widow
wwwoffle
xaldon\ webspider
zeus

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

chinaclaw

Rule Path
Disallow /

custo

Rule Path
Disallow /

disco

Rule Path
Disallow /

download\ demon

Rule Path
Disallow /

ecatch

Rule Path
Disallow /

eirgrabber

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

emailwolf

Rule Path
Disallow /

express\ webpictures

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

eyenetie

Rule Path
Disallow /

flashget

Rule Path
Disallow /

getright

Rule Path
Disallow /

getweb!

Rule Path
Disallow /

go!zilla

Rule Path
Disallow /

go-ahead-got-it

Rule Path
Disallow /

grabnet

Rule Path
Disallow /

grafula

Rule Path
Disallow /

hmview

Rule Path
Disallow /

httrack

Rule Path
Disallow /

image\ stripper

Rule Path
Disallow /

image\ sucker

Rule Path
Disallow /

indy\ library

Rule Path
Disallow /

interget

Rule Path
Disallow /

internet\ ninja

Rule Path
Disallow /

jetcar

Rule Path
Disallow /

joc\ web\ spider

Rule Path
Disallow /

larbin

Rule Path
Disallow /

leechftp

Rule Path
Disallow /

mass\ downloader

Rule Path
Disallow /

midown\ tool

Rule Path
Disallow /

mister\ pix

Rule Path
Disallow /

navroad

Rule Path
Disallow /

nearsite

Rule Path
Disallow /

netants

Rule Path
Disallow /

netspider

Rule Path
Disallow /

net\ vampire

Rule Path
Disallow /

netzip

Rule Path
Disallow /

octopus

Rule Path
Disallow /

offline\ explorer

Rule Path
Disallow /

offline\ navigator

Rule Path
Disallow /

pagegrabber

Rule Path
Disallow /

papa\ foto

Rule Path
Disallow /

pavuk

Rule Path
Disallow /

pcbrowser

Rule Path
Disallow /

realdownload

Rule Path
Disallow /

reget

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

smartdownload

Rule Path
Disallow /

superbot

Rule Path
Disallow /

superhttp

Rule Path
Disallow /

surfbot

Rule Path
Disallow /

takeout

Rule Path
Disallow /

teleport\ pro

Rule Path
Disallow /

voideye

Rule Path
Disallow /

web\ image\ collector

Rule Path
Disallow /

web\ sucker

Rule Path
Disallow /

webauto

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webfetch

Rule Path
Disallow /

webgo\ is

Rule Path
Disallow /

webleacher

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

websauger

Rule Path
Disallow /

website\ extractor

Rule Path
Disallow /

website\ quester

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webwhacker

Rule Path
Disallow /

webzip

Rule Path
Disallow /

wget

Rule Path
Disallow /

widow

Rule Path
Disallow /

wwwoffle

Rule Path
Disallow /

xaldon\ webspider

Rule Path
Disallow /

zeus

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

chinaclaw

Rule Path
Disallow /

custo

Rule Path
Disallow /

disco

Rule Path
Disallow /

download\ demon

Rule Path
Disallow /

ecatch

Rule Path
Disallow /

eirgrabber

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

emailwolf

Rule Path
Disallow /

express\ webpictures

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

eyenetie

Rule Path
Disallow /

flashget

Rule Path
Disallow /

getright

Rule Path
Disallow /

getweb!

Rule Path
Disallow /

go!zilla

Rule Path
Disallow /

go-ahead-got-it

Rule Path
Disallow /

grabnet

Rule Path
Disallow /

grafula

Rule Path
Disallow /

hmview

Rule Path
Disallow /

httrack

Rule Path
Disallow /

image\ stripper

Rule Path
Disallow /

image\ sucker

Rule Path
Disallow /

indy\ library

Rule Path
Disallow /

interget

Rule Path
Disallow /

internet\ ninja

Rule Path
Disallow /

jetcar

Rule Path
Disallow /

joc\ web\ spider

Rule Path
Disallow /

larbin

Rule Path
Disallow /

leechftp

Rule Path
Disallow /

mass\ downloader

Rule Path
Disallow /

midown\ tool

Rule Path
Disallow /

mister\ pix

Rule Path
Disallow /

navroad

Rule Path
Disallow /

nearsite

Rule Path
Disallow /

netants

Rule Path
Disallow /

netspider

Rule Path
Disallow /

net\ vampire

Rule Path
Disallow /

netzip

Rule Path
Disallow /

octopus

Rule Path
Disallow /

offline\ explorer

Rule Path
Disallow /

offline\ navigator

Rule Path
Disallow /

pagegrabber

Rule Path
Disallow /

papa\ foto

Rule Path
Disallow /

pavuk

Rule Path
Disallow /

pcbrowser

Rule Path
Disallow /

realdownload

Rule Path
Disallow /

reget

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

smartdownload

Rule Path
Disallow /

superbot

Rule Path
Disallow /

superhttp

Rule Path
Disallow /

surfbot

Rule Path
Disallow /

takeout

Rule Path
Disallow /

teleport\ pro

Rule Path
Disallow /

voideye

Rule Path
Disallow /

web\ image\ collector

Rule Path
Disallow /

web\ sucker

Rule Path
Disallow /

webauto

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webfetch

Rule Path
Disallow /

webgo\ is

Rule Path
Disallow /

webleacher

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

websauger

Rule Path
Disallow /

website\ extractor

Rule Path
Disallow /

website\ quester

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webwhacker

Rule Path
Disallow /

webzip

Rule Path
Disallow /

wget

Rule Path
Disallow /

widow

Rule Path
Disallow /

wwwoffle

Rule Path
Disallow /

xaldon\ webspider

Rule Path
Disallow /

zeus

Rule Path
Disallow /

Comments

  • Advanced Wordpress
  • Prevent Crawling of WordPress JSON API Endpoints
  • Block Parameters
  • Block Feed
  • Block Spam Directories
  • Block archive.org bots
  • Block Chatgpt
  • Block Cart Page
  • Block Checkout Page
  • Block Login Page
  • Block Woocommerce Parameters
  • Block Ahrefs Crawler
  • Block Semrush Crawler
  • Block Moz Crawler
  • Block Majestic Crawler
  • Block Xenu Crawler
  • Block Scrapper Bots

Warnings

  • 8 invalid lines.