lucyindelucht.nl
robots.txt

Robots Exclusion Standard data for lucyindelucht.nl

Resource Scan

Scan Details

Site Domain lucyindelucht.nl
Base Domain lucyindelucht.nl
Scan Status Ok
Last Scan2024-10-24T03:08:36+00:00
Next Scan 2024-11-07T03:08:36+00:00

Last Scan

Scanned2024-10-24T03:08:36+00:00
URL https://lucyindelucht.nl/robots.txt
Redirect https://www.lucyindelucht.nl/robots.txt
Redirect Domain www.lucyindelucht.nl
Redirect Base lucyindelucht.nl
Domain IPs 185.50.95.98, 2a00:c660:5126:2100::3
Redirect IPs 185.50.95.98, 2a00:c660:5126:2100::3
Response IP 185.50.95.98
Found Yes
Hash c12448315678edd2f873d47e3f22fb928e7d39502170f926014e8c2acb226441
SimHash 345fd14bc6b5

Groups

*

Rule Path
Disallow /ajax/
Disallow /api/
Disallow /CFIDE/
Disallow /includes/
Disallow /spanz/
Disallow /vacatures/direct-solliciteren/
Disallow /readme.html

Other Records

Field Value
crawl-delay 20

baiduspider

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

cazoodlebot

Rule Path
Disallow /

datacha0s

Rule Path
Disallow /

doc

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

emailwolf

Rule Path
Disallow /

fasterfox

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

jobdiggerspider

Rule Path
Disallow /

jyxobot

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

linko

Rule Path
Disallow /

mediapartners-google*

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

npbot

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

orthogaffe

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

shopwiki

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

synapse

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webzip

Rule Path
Disallow /

xenu

Rule Path
Disallow /

zao

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.lucyindelucht.nl/sitemap.xml

Comments

  • Spanz cms Dynamic Sitemap
  • All domains
  • Directories
  • Files
  • For specific bots (on all domains)
  • Hi! Trying to reverse engineer something?
  • Maybe you should come work with us.
  • Apply at www.tuesday.nl and mention this comment.