thomasreiner.pro
robots.txt

Robots Exclusion Standard data for thomasreiner.pro

Resource Scan

Scan Details

Site Domain thomasreiner.pro
Base Domain thomasreiner.pro
Scan Status Ok
Last Scan2025-10-22T15:32:35+00:00
Next Scan 2025-10-29T15:32:35+00:00

Last Scan

Scanned2025-10-22T15:32:35+00:00
URL https://www.thomasreiner.pro/robots.txt
Domain IPs 54.36.204.21, 91.134.231.21
Response IP 91.134.231.21
Found Yes
Hash 99c333adb56a274d3bf46222725da4ecc31d4efa434a4e761d1177353bc0b54b
SimHash 738e8918ccc2

Groups

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

bubing

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

linguee bot

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

adscanner

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

googleother

Rule Path
Disallow /

googleother-image

Rule Path
Disallow /

googleother-video

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

ai2bot

Rule Path
Disallow /

ai2bot-dolma

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

cohere-training-data-crawler

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

iaskspider/2.0

Rule Path
Disallow /

icc-crawler

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

img2dataset

Rule Path
Disallow /

isscyberriskcrawler

Rule Path
Disallow /

kangaroo bot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

webzio-extended

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

sidetrade indexer bot

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

youbot

Rule Path
Disallow /

*

Rule Path
Disallow *search%3D*
Disallow *.rss
Disallow /*?r=1
Disallow /*?fis=*
Disallow /*?subgallery=*
Disallow /lightbox
Disallow /lightbox?*
Disallow /cart
Disallow /cart?*
Disallow /quotations/*
Disallow /users/*
Disallow /downloads/*
Disallow /invoices/*
Disallow /media/*/price
Disallow /media/*/price/*
Disallow /media/*/share
Disallow /media/*?download=*
Disallow /media/*/rate*rate%3D*
Disallow /-/*/medias/*/price
Disallow /-/*/medias/*/price/*
Disallow /-/*/medias/*/share
Disallow /-/*/medias/*?download=*
Disallow /-/*/medias/*/rate*rate%3D*

Other Records

Field Value
crawl-delay 30

Other Records

Field Value
sitemap https://www.thomasreiner.pro/sitemap.xml