bookmarketing.pro
robots.txt

Robots Exclusion Standard data for bookmarketing.pro

Resource Scan

Scan Details

Site Domain bookmarketing.pro
Base Domain bookmarketing.pro
Scan Status Ok
Last Scan2024-10-25T08:48:09+00:00
Next Scan 2024-11-24T08:48:09+00:00

Last Scan

Scanned2024-10-25T08:48:09+00:00
URL https://bookmarketing.pro/robots.txt
Domain IPs 52.4.208.217
Response IP 52.4.208.217
Found Yes
Hash 6e6a0738b68bba517ca87705505575ca23b749fcfa73d9913a75f314a707df64
SimHash f21e955143e1

Groups

*

Rule Path
Disallow /administrator/
Disallow /bin/
Disallow /cache/
Disallow /cli/
Disallow /components/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /layouts/
Disallow /libraries/
Disallow /logs/
Disallow /media/
Disallow /modules/
Disallow /plugins/
Disallow /templates/
Disallow /tmp/

bingbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 20

msnbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 20

slurp

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

mj12bot

Rule Path
Disallow /

pmoz.info

Rule Path
Disallow /

yak

Rule Path
Disallow /

alphaseobot

Rule Path
Disallow /

alphaseobot-sa

Rule Path
Disallow /

omgili

Rule Path
Disallow /

linkdexbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

pinterest

Rule Path
Disallow /

uptimebot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

stremorbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

sosospider+

Rule Path
Disallow /

wonderbot/js 1.0

Rule Path
Disallow /

hubspot

Rule Path
Disallow /

surdotlybot/1.0

Rule Path
Disallow /

spbot/5.0.3

Rule Path
Disallow /

yandex

Rule Path
Disallow /

moget
ichiro

Rule Path
Disallow /

naverbot
yeti

Rule Path
Disallow /

baiduspider
baiduspider-video
baiduspider-image

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

linkdexbot/2.0

Rule Path
Disallow /

bubing

Rule Path
Disallow /

Comments

  • If the Joomla site is installed within a folder such as at
  • e.g. www.example.com/joomla/ the robots.txt file MUST be
  • moved to the site root at e.g. www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to the disallowed
  • path, e.g. the Disallow rule for the /administrator/ folder
  • MUST be changed to read Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/orig.html
  • For syntax checking, see:
  • http://tool.motoricerca.info/robots-checker.phtml
  • User-agent: Googlebot
  • Allow: /
  • Crawl-delay: 10