davidkassan.com
robots.txt

Robots Exclusion Standard data for davidkassan.com

Resource Scan

Scan Details

Site Domain davidkassan.com
Base Domain davidkassan.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-18T20:20:04+00:00
Next Scan 2024-12-17T20:20:04+00:00

Last Successful Scan

Scanned2024-01-30T20:17:03+00:00
URL https://www.davidkassan.com/robots.txt
Domain IPs 162.159.130.90, 162.159.133.90
Response IP 162.159.133.90
Found Yes
Hash b526e37588f8754de06448ec84893e7efd026b26d6f2e12e05875d650ff9fe53
SimHash ab3ec814499b

Groups

*

Rule Path
Disallow /admin/
Disallow /mobile/viewer/
Disallow /*cloudflare.js
Disallow /bt/
Disallow /home/
Disallow /*.php
Disallow /genericdb/

mediapartners-google*

Rule Path
Disallow

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

scoutjet

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

gptbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

swebot

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

twengabot-discover

Rule Path
Disallow /

dataprovider

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

bender

Rule Path
Disallow /

discobot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

searchwebengine.net

Rule Path
Disallow /

mlbot

Rule Path
Disallow /

nextgensearchbot

Rule Path
Disallow /

speedy

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

nerdbynature.bot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

sindicebot

Rule Path
Disallow /

plukkie

Rule Path
Disallow /

findfiles.net

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

goodzer

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

lemurwebcrawler

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

edisterbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

huaweisymantecspider

Rule Path
Disallow /

pagepeeker

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.davidkassan.com/sitemap.xml

Comments

  • Disallowed stuff
  • All other robots will spider the domain