mycolemans.com
robots.txt

Robots Exclusion Standard data for mycolemans.com

Resource Scan

Scan Details

Site Domain mycolemans.com
Base Domain mycolemans.com
Scan Status Ok
Last Scan2025-08-03T02:52:35+00:00
Next Scan 2025-09-02T02:52:35+00:00

Last Scan

Scanned2025-08-03T02:52:35+00:00
URL https://mycolemans.com/robots.txt
Domain IPs 204.101.35.11
Response IP 204.101.35.11
Found Yes
Hash 47d5be96b37d3a3cd9fdef984e88b2748f3e396f12fe13900c8fa6e8ae7cbc99
SimHash 601cd9c0ec9b

Groups

mj12bot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

fatbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

youbot

Rule Path
Disallow /

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

Other Records

Field Value
sitemap https://mycolemans.com/SiteMap.xml