propertynews.pl
robots.txt

Robots Exclusion Standard data for propertynews.pl

Resource Scan

Scan Details

Site Domain propertynews.pl
Base Domain propertynews.pl
Scan Status Ok
Last Scan2024-09-26T04:27:54+00:00
Next Scan 2024-10-03T04:27:54+00:00

Last Scan

Scanned2024-09-26T04:27:54+00:00
URL https://propertynews.pl/robots.txt
Redirect https://www.propertynews.pl/robots.txt
Redirect Domain www.propertynews.pl
Redirect Base propertynews.pl
Domain IPs 51.77.44.234
Redirect IPs 51.77.44.234
Response IP 51.77.44.234
Found Yes
Hash 8208c4f4ceb3b43228d76de678b0d2fb01da20ef213b626985d1952c73fed7a9
SimHash 739f53e1c7a2

Groups

*

Rule Path
Disallow /oferty/praca/
Disallow /oferty/komunikaty/
Disallow /oferty/szkolenia/
Disallow /szukaj.html
Disallow /m/
Disallow /mobile/
Disallow /forumax.html
Disallow /linkedin.html
Disallow /hotels-map/szukaj?*
Disallow /retail-map/szukaj?*
Disallow /warehouses-map/szukaj.html?*
Disallow /en/search.html?qt=*
Disallow /rss/serwis_rss_*.xml

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

yandex

Rule Path
Disallow /

aboundex
accelobot
add\ catalog
ahrefsbot
aihitbot
alexibot
aqua_products
askjeeves
asterias
awcheckbot
b2w/0.1
backdoorbot/1.0
backlinkcrawler
baiduspider
becomebot
blexbot
blowfish/1.0
bookmark search tool
botalot
brandwatch.net
builtbottough
bullseye/1.0
bunnyslippers
butterfly
catchbot
charlotte
cheesebot
cherrypicker
cherrypickerelite/1.0
cherrypickerse/1.0
clipish
cliqzbot
comodo
comodo-certificates-spider
compspybot
copernic
copyrightcheck
cosmos
crawler
crescent
crescent internet toolpak http ole control v.1.0
curious
curl
dataprovider\.com
dinoping
discoverybot
dittospyder
domaincrawler
domaincrawler
dotbot
dotnetdotcom
dow\ jones\ searchbot
dumbot
easouspider
emailcollector
emailsiphon
emailwolf
enterprise_search
enterprise_search/1.0
erocrawler
es
exabot
extractorpro
ezinearticleslinkscanner
ezooms
fairad client
flaming attackbot
foobot
freefind
ftrf\:\ friendly
gaisbot
getright/4.2
gigabot
grub
grub-client
harvest/1.5
hatena antenna
hloader
http://www.searchengineworld.com bot
http://www.webmasterworld.com bot
http_request
http_request2
httplib
humanlinks
ia_archiver
ia_archiver
ia_archiver/1.6
indy\ library
infonavirobot
ip\-web\-crawler\.com
iron33/1.0.2
jakarta\ commons-httpclient
jeeves
jennybot
jetbot
jetbot/1.0
jikespider
kenjin spider
keyword density/0.9
larbin
lexibot
libweb/clshttp
libwww-perl
lindex\.com
linguee
linkdex\.com
linkdexbot
linkextractorpro
linkscan/8.1a unix
linkwalker
lipperhey
lnspiderguy
looksmart
ltbot
lwp-trivial
lwp-trivial/1.34
magpie\-crawler
mata hari
microsoft url control
microsoft url control - 5.01.4511
microsoft url control - 6.00.8169
miixpc
miixpc/4.2
mister pix
mj12bot
moget
moget/2.1
msie\ or\ firefox\ mutant
msiecrawler
naver
ncbot
netants
netcraftsurveyagent
netestate\ ne\ crawler
netmechanic
netseer
nextgensearchbot
nicerspro
nutch
nutch
ocelli
offline explorer
omniexplorer_bot
openbot
openfind
openfind
openfind data gathere
openwebindex
oracle ultra search
pagesinventory
pear
peoplepal
perman
procogseobot
propowerbot/2.14
prowebwalker
proximic
psbot
purebot
queryn metasearch
queryseekerspider
radiation retriever 1.1
repomonkey
repomonkey bait & tackle/v1.01
riddler
rma
rojerbot
rytebot
scooter
scoutjet
scrapy
screenerbot
searchmetrics
searchpreview
semrushbot
sentibot
seo-crawling
seoengworldbot
seokicks-robot
shopwiki
sistrix
sitebot
sitesnagger
snoopy
socialsearcher
sogou
sogou
solomonobot
sootle
sosospider
spankbot
spanner
spbot
speedy
stanford
stanford comp sci
surveybot
suzuran
szukacz/1.4
szukacz/1.4
teleport
teleportpro
telesoft
teoma
the intraformant
the\ incutio\ xml-rpc\ php\ library
thenomad
tocrawl/urldispatcher
true_robot
true_robot/1.0
turingos
turnitinbot
ucrawler
url control
url_spider_pro
urly warning
vci
vci webviewer vci webviewer win32
visaduhoc\.info
wbsearchbot
web image collector
webauto
webbandit
webbandit/3.50
webcapture
webcopier
webenhancer
webindetail\.com
webmasterworld extractor
webmasterworldforumbot
websauger
website quester
websitetheweb\.com
webster pro
webstripper
webvac
webzip
webzip/4.0
wget
wget/1.5.3
wget/1.6
wotbot
www\.integromedb\.org
www-collector-e
xpymep\.exe
yamanalab-robot
yisouspider
yodaobot
youdaobot
zend_http_client
zeus
zeus 32297 webster pro v2.9 win32
zeus link scout
zmeu
zumbot

Rule Path
Disallow /

ai2bot
ai2bot-dolma
amazonbot
anthropic-ai
applebot
applebot-extended
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
friendlycrawler
google-extended
googleother
googleother-image
googleother-video
gptbot
iaskspider/2.0
icc-crawler
imagesiftbot
img2dataset
meta-externalagent
meta-externalfetcher
moodlebot
newsnow
oai-searchbot
omgili
omgilibot
perplexitybot
petalbot
piplbot
scrapy
timpibot
velenpublicwebcrawler
webzio-extended
youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.propertynews.pl/sitemapindex.xml
sitemap https://www.propertynews.pl/sitemap-news.xml

Comments

  • SearchEngines
  • 3rdParties
  • AI/LLMs
  • Sitemaps

Warnings

  • 4 invalid lines.