technopolis.bg
robots.txt

Robots Exclusion Standard data for technopolis.bg

Resource Scan

Scan Details

Site Domain technopolis.bg
Base Domain technopolis.bg
Scan Status Ok
Last Scan2024-10-01T09:01:34+00:00
Next Scan 2024-10-15T09:01:34+00:00

Last Scan

Scanned2024-10-01T09:01:34+00:00
URL https://technopolis.bg/robots.txt
Redirect https://www.technopolis.bg/robots.txt
Redirect Domain www.technopolis.bg
Redirect Base technopolis.bg
Domain IPs 104.22.12.105, 104.22.13.105, 172.67.8.198, 2606:4700:10::6816:c69, 2606:4700:10::6816:d69, 2606:4700:10::ac43:8c6
Redirect IPs 104.22.12.105, 104.22.13.105, 172.67.8.198, 2606:4700:10::6816:c69, 2606:4700:10::6816:d69, 2606:4700:10::ac43:8c6
Response IP 104.22.12.105
Found Yes
Hash a9e66860322c03888ce3f3a0fd017a763b32793dc25a440193c31cbfa887b4bc
SimHash e361bb75ce4f

Groups

*
rogerbot
mj12bot
exabot
dotbot
gigabot
blackwidow
cazoodlebot
chinaclaw
custo
disco
download\ demon
dotbot/1.0
ecatch
eirgrabber
emailsiphon
emailwolf
express\ webpictures
extractorpro
eyenetie
flashget
getright
getweb!
gigabot
go!zilla
go-ahead-got-it
grabnet
grafula
hmview
httrack
image\ stripper
image\ sucker
indy\ library
interget
internet\ ninja
jetcar
joc\ web\ spider
larbin
leechftp
mass\ downloader
midown\ tool
mister\ pix
mj12bot
navroad
nearsite
netants
netspider
net\ vampire
netzip
octopus
offline\ explorer
offline\ navigator
pagegrabber
papa\ foto
pavuk
pcbrowser
realdownload
reget
sitesnagger
smartdownload
superbot
superhttp
surfbot
takeout
teleport\ pro
voideye
web\ image\ collector
web\ sucker
webauto
webcopier
webfetch
webgo\ is
webleacher
webreaper
websauger
website\ extractor
website\ quester
webstripper
webwhacker
webzip
wget
widow
wwwoffle
xaldon\ webspider
zeus
scrapy
petalbot

Rule Path
Disallow /

googlebot
bingbot
yandexbot
applebot
twitterbot
linkedinbot
pinterestbot
facebook external hit, facebook crawler
gptbot
slurp
google-inspectiontool
claudebot
baiduspider
adsbot-google
adsbot-google-mobile
offeristacrawler/1.0
ahrefsbot
semrushbot
screaming frog seo spider

Rule Path
Allow /
Disallow /bg/cart
Disallow /bg/checkout
Disallow /bg/my-account
Disallow /bg/return-product
Disallow /en/return-product
Disallow /en/cart
Disallow /en/checkout
Disallow /en/my-account
Disallow *pricerange%3D*
Disallow *pageselect%3D*
Disallow *relevance%3A*
Disallow /medias/*Variant-colour*
Disallow /medias/*Product-bundle*
Disallow /medias/*FreeGift*
Disallow /medias/*Mini-cart*
Disallow /medias/*Basket*

googlebot-image

Rule Path
Allow /
Disallow /bg/cart
Disallow /bg/checkout
Disallow /bg/my-account
Disallow /bg/return-product
Disallow /en/return-product
Disallow /en/cart
Disallow /en/checkout
Disallow /en/my-account
Disallow /medias/*Variant-colour*
Disallow /medias/*Product-bundle*
Disallow /medias/*FreeGift*
Disallow /medias/*Mini-cart*
Disallow /medias/*Basket*

googlebot-mobile

Rule Path
Allow /
Disallow /bg/cart
Disallow /bg/checkout
Disallow /bg/my-account
Disallow /bg/return-product
Disallow /en/return-product
Disallow /en/cart
Disallow /en/checkout
Disallow /en/my-account
Disallow /medias/*Variant-colour*
Disallow /medias/*Product-bundle*
Disallow /medias/*FreeGift*
Disallow /medias/*Mini-cart*
Disallow /medias/*Basket*

bingbot

Rule Path
Allow /
Disallow /bg/cart
Disallow /bg/checkout
Disallow /bg/my-account
Disallow /bg/return-product
Disallow /en/return-product
Disallow /en/cart
Disallow /en/checkout
Disallow /en/my-account
Disallow /medias/*Variant-colour*
Disallow /medias/*Product-bundle*
Disallow /medias/*FreeGift*
Disallow /medias/*Mini-cart*
Disallow /medias/*Basket*

claudebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.technopolis.bg/sitemap.xml

Comments

  • For all robots
  • Allow access to specific groups of bots
  • Allow access to specific groups of crawler tools - May be
  • Allow search crawlers to discover the sitemap