pullmanradio.com
robots.txt

Robots Exclusion Standard data for pullmanradio.com

Resource Scan

Scan Details

Site Domain pullmanradio.com
Base Domain pullmanradio.com
Scan Status Ok
Last Scan2024-10-22T10:52:18+00:00
Next Scan 2024-11-21T10:52:18+00:00

Last Scan

Scanned2024-10-22T10:52:18+00:00
URL https://pullmanradio.com/robots.txt
Domain IPs 35.215.86.190
Response IP 35.215.86.190
Found Yes
Hash 8366844347bd18627d5bcd0d4515d3346ec3fe7115bbb3ab9bcbdd09769d9004
SimHash 529e7333e3d3

Groups

*

Rule Path
Disallow /calendar/action~posterboard/
Disallow /calendar/action~agenda/
Disallow /calendar/action~oneday/
Disallow /calendar/action~month/
Disallow /calendar/action~week/
Disallow /calendar/action~stream/

a6-indexer

Rule Path
Disallow /

aboundex
asterias
backdoorbot/1.0
backlinkcrawler
black hole
blowfish/1.0
botalot
builtbottough
bullseye/1.0
bunnyslippers
ca-crawler
ccbot
ccbot/2.0
cegbfeieh
cheesebot
cherrypicker
cherrypickerelite/1.0
cherrypickerse/1.0
copyrightcheck
cosmos
crescent
crescent internet toolpak http ole control v.1.0
dbot
dittospyder
easouspider
eccp
emailcollector
emailsiphon
emailwolf
erocrawler
exabot/3.0
extractorpro
foobot
fr-crawler
friendly crawler
gigablastopensource
goodzer/2.0
grapeshotcrawler/2.0
harvest/1.5
heritrix/1.14.4
hloader
httplib
hubspot crawler 1.0
humanlinks
istellabot
infonavirobot
jennybot
kenjin spider
keyword density/0.9
konqueror/3.5
lexibot
libweb/clshttp
linkextractorpro
linkscan/8.1a unix
linkwalker
lipperhey seo service
lnspiderguy
lwp-trivial
lwp-trivial/1.34
mata hari
meanpathbot
microsoft url control - 5.01.4511
microsoft url control - 6.00.8169
miixpc
miixpc/4.2
mister pix
mixrankbot
moget
moget/2.1
mozilla/4
mozilla/4.0 (compatible; bullseye; windows 95)
mozilla/4.0 (compatible; msie 4.0; windows 95)
mozilla/4.0 (compatible; msie 4.0; windows 98)
mozilla/4.0 (compatible; msie 4.0; windows nt)
mozilla/4.0 (compatible; msie 4.0; windows xp)
mozilla/4.0 (compatible; msie 4.0; windows 2000)
mozilla/4.0 (compatible; msie 4.0; windows me)
mozilla/5
nbot/2.0
netants
netzcheckbot/1.0
nicerspro
obot/2.3.1
offline explorer
openfind
openfind data gathere
pagesinventory
panscient.com

Rule Path
Disallow /https%3A//www.siteground.com/kb/google_marked_my_website_as_harmful/

propowerbot/2.14
prowebwalker
queryn metasearch
repomonkey
repomonkey bait & tackle/v1.01
riddler
rma
ru_bot
screenerbot crawler beta 2.0
semrushbot/0.98~bl
seoengworldbot
seokicks-robot
seplinkbot
seplinkbot/1.0
seznambot
sistrix
sitesnagger
smtbot/1.0
spankbot
spbot
sogou web spider
spanner
suzuran
szukacz/1.4
teleport
teleportpro
telesoft
the intraformant
thenomad
tighttwatbot
titan
tocrawl/urldispatcher
true_robot
true_robot/1.0
turingos
urlappendbot
urly warning
vci
vci webviewer vci webviewer win32
voltron
wbsearchbot
web image collector
webauto
webbandit
webbandit/3.50
webcapture 2.0
webcheck 1.10.4
webcopierdanilo
c14542.sgvps.net webenhancer
webmasterworldforumbot
websauger
website quester
webster pro
webstripper
webzip
webzip/4.0
wesee
wget
wget/1.5.3
wget/1.6
www-collector-e
www.integromedb.org/crawler
xenu link sleuth
xenu's link sleuth 1.1c
zeus
zeus 32297 webster pro v2.9 win32
xovibot

Rule Path
Disallow /Danilo

zumbot

Rule Path
Disallow /cgi-bin/

ahrefsbot

No rules defined. All paths allowed.

baiduspider
baiduspider
baiduspider+

No rules defined. All paths allowed.

sch-fast-se-crawl02.osl.basefarm.net

Rule Path
Disallow /

sch-fast-se-crawl04.osl.basefarm.net
ichiro
naverbot
yeti
baiduspider-video
baiduspider-image
sogou spider
youdaobot
mj12bot
googlebot-image
googlebot
mediapartners-google/2.1
mediapartners-google*
msnbot
msnbot-newsblogs
slurp
yahoo-mmcrawler
yahoo-blogs/v3.9
gigabot
ia_archiver
botrighthere
larbin
b2w/0.1
copernic
psbot
python-urllib
netmechanic
url_spider_pro
alexibot
webcopier
openfind data gatherer
xenu's
openbot
url control
zeus link scout
iron33/1.0.2
bookmark search tool
getright/4.2
fairad client
gaisbot
aqua_products
radiation retriever 1.1
flaming attackbot
curl
web reaper
firefox
opera
netscape
webvulncrawl
webvulnscan

Rule Path
Disallow /calendar-2/action~posterboard/
Disallow /calendar-2/action~agenda/
Disallow /calendar-2/action~oneday/
Disallow /calendar-2/action~month/
Disallow /calendar-2/action~week/
Disallow /calendar-2/action~stream/
Disallow /calendar-2/action~undefined/
Disallow /calendar-2/action~http%3A/
Disallow /calendar-2/action~default/
Disallow /calendar-2/action~poster/
Disallow /calendar-2/action~*/
Disallow /*controller%3Dai1ec_exporter_controller*
Disallow /*/action~*/

Comments

  • Begin Exclusion From Directories from robots.txt
  • disallow OSL.basefarm.net

Warnings

  • 3 invalid lines.
  • `user-agenc14542.sgvps.nett` is not a known field.