jasonhawkes.com
robots.txt

Robots Exclusion Standard data for jasonhawkes.com

Resource Scan

Scan Details

Site Domain jasonhawkes.com
Base Domain jasonhawkes.com
Scan Status Ok
Last Scan2025-10-08T09:52:55+00:00
Next Scan 2025-11-07T09:52:55+00:00

Last Scan

Scanned2025-10-08T09:52:55+00:00
URL https://www.jasonhawkes.com/robots.txt
Domain IPs 54.36.204.21, 91.134.231.21
Response IP 91.134.231.21
Found Yes
Hash eb35a88730880fe4386904313143084d5938b0f5223736abe428f8bbfe8113a4
SimHash 730e826a8dc6

Groups

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

bubing

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

linguee bot

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

adscanner

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

googleother

Rule Path
Disallow /

googleother-image

Rule Path
Disallow /

googleother-video

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

ai2bot

Rule Path
Disallow /

ai2bot-dolma

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

cohere-training-data-crawler

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

iaskspider/2.0

Rule Path
Disallow /

icc-crawler

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

img2dataset

Rule Path
Disallow /

isscyberriskcrawler

Rule Path
Disallow /

kangaroo bot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

webzio-extended

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

sidetrade indexer bot

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

youbot

Rule Path
Disallow /

*

Rule Path
Disallow *search%3D*
Disallow *.rss
Disallow /*?r=1
Disallow /*?fis=*
Disallow /*?subgallery=*
Disallow /lightbox
Disallow /lightbox?*
Disallow /cart
Disallow /cart?*
Disallow /quotations/*
Disallow /users/*
Disallow /downloads/*
Disallow /invoices/*
Disallow /media/*/price
Disallow /media/*/price/*
Disallow /media/*/share
Disallow /media/*?download=*
Disallow /media/*/rate*rate%3D*
Disallow /-/*/medias/*/price
Disallow /-/*/medias/*/price/*
Disallow /-/*/medias/*/share
Disallow /-/*/medias/*?download=*
Disallow /-/*/medias/*/rate*rate%3D*
Disallow /m/lightbox
Disallow /m/lightbox?*
Disallow /m/cart
Disallow /m/cart?*
Disallow /m/quotations/*
Disallow /m/users/*
Disallow /m/downloads/*
Disallow /m/invoices/*
Disallow /m/media/*/price
Disallow /m/media/*/price/*
Disallow /m/media/*/share
Disallow /m/media/*?download=*
Disallow /m/media/*/rate*rate%3D*
Disallow /m/-/*/medias/*/price
Disallow /m/-/*/medias/*/price/*
Disallow /m/-/*/medias/*/share
Disallow /m/-/*/medias/*?download=*
Disallow /m/-/*/medias/*/rate*rate%3D*

Other Records

Field Value
crawl-delay 30

Other Records

Field Value
sitemap https://www.jasonhawkes.com/sitemap.xml