opensourcemusings.com
robots.txt

Robots Exclusion Standard data for opensourcemusings.com

Resource Scan

Scan Details

Site Domain opensourcemusings.com
Base Domain opensourcemusings.com
Scan Status Ok
Last Scan2025-10-10T14:17:07+00:00
Next Scan 2025-10-11T14:17:07+00:00

Last Scan

Scanned2025-10-10T14:17:07+00:00
URL https://opensourcemusings.com/robots.txt
Domain IPs 35.185.44.232
Response IP 35.185.44.232
Found Yes
Hash 920a84a3f08d46e824ea3e9bfc577be45a97399c74c5f7e6ce01789aa3c02566
SimHash 75924900e6e7

Groups

*

Rule Path
Disallow /images/

Other Records

Field Value
crawl-delay 3

*

Rule Path
Disallow /collateral/

Other Records

Field Value
crawl-delay 3

baidulink

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

bingbot-image

Rule Path
Disallow /

bingbot-mobile

Rule Path
Disallow /

bingbot-news

Rule Path
Disallow /

bingbot-video

Rule Path
Disallow /

bingpreview

Rule Path
Disallow /

duckduckbot

Rule Path
Disallow /

duckduckgobot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

googlebot

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /

googlebot-mobile

Rule Path
Disallow /

googlebot-news

Rule Path
Disallow /

googlebot-video

Rule Path
Disallow /

mail.ru bot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

ai2bot

Rule Path
Disallow /

ai2bot-dolma

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

applebot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

iaskspider/2.0

Rule Path
Disallow /

icc-crawler

Rule Path
Disallow /

img2dataset

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

openai gptbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

webzio-extended

Rule Path
Disallow /

you.com bot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

commoncrawler

Rule Path
Disallow /

httrack

Rule Path
Disallow /

megauploadbot

Rule Path
Disallow /

netcraftsurveyagent

Rule Path
Disallow /

wget

Rule Path
Disallow /

cocoonbot

Rule Path
Disallow /

datanyzebot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

favicon checker

Rule Path
Disallow /

googleother

Rule Path
Disallow /

nutch

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

facebookexternalhit

Rule Path
Disallow /

linkedinbot

Rule Path
Disallow /

pinterestbot

Rule Path
Disallow /

quorabot

Rule Path
Disallow /

slackbot

Rule Path
Disallow /

slackbot-linkexpanding

Rule Path
Disallow /

tumblrbot

Rule Path
Disallow /

twitterbot

Rule Path
Disallow /

whatsappbot

Rule Path
Disallow /

spider

Rule Path
Disallow /

unknownbot

Rule Path
Disallow /

Comments

  • generated by aisearchwatch.com/t/protect-from-ai
  • Search engine crawlers
  • AI bots
  • Archiving/data collection bots
  • Data extraction/analysis bots
  • Image analysis bots
  • SEO/marketing analytics bots
  • Social media integration bots
  • Misc bots
  • generated by aisearchwatch.com/t/protect-from-ai