capmh.biomedcentral.com
robots.txt

Robots Exclusion Standard data for capmh.biomedcentral.com

Resource Scan

Scan Details

Site Domain capmh.biomedcentral.com
Base Domain biomedcentral.com
Scan Status Ok
Last Scan2025-08-31T08:12:07+00:00
Next Scan 2025-09-30T08:12:07+00:00

Last Scan

Scanned2025-08-31T08:12:07+00:00
URL https://capmh.biomedcentral.com/robots.txt
Domain IPs 151.101.0.95, 151.101.128.95, 151.101.192.95, 151.101.64.95
Response IP 199.232.44.95
Found Yes
Hash 11c5a148c8f1894d18e0854bdd156585ae12e4b3228414373181a5c3626e1212
SimHash 40ace950e0a4

Groups

*

Rule Path
Disallow /search
Disallow */1000$
Disallow /epdf/
Disallow tab%3D
Disallow /about/institutional-support/membership/
Disallow /about/membership/members/
Disallow /about/oa-funding-and-policy-support/
Disallow /*/*/*/sharedit
Disallow */platform/contextual*
Disallow */cas-redirect/*
Disallow *.ris
Disallow */articles/*/*/figures/*
Disallow */articles/*/*/metrics/*
Disallow */articles/*/*/peer-review/*
Disallow */articles/*/*/comments/*
Disallow /placeholder/v1/membership/message

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

googleother

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

img2dataset

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

Other Records

Field Value
sitemap https://capmh.biomedcentral.com/sitemap.xml