wbur.org
robots.txt

Robots Exclusion Standard data for wbur.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	wbur.org
Base Domain	wbur.org
Scan Status	Ok
Last Scan	2024-11-13T10:55:40+00:00
Next Scan	2024-11-20T10:55:40+00:00

Last Scan

Scanned	2024-11-13T10:55:40+00:00
URL	https://wbur.org/robots.txt
Redirect	https://www.wbur.org/robots.txt
Redirect Domain	www.wbur.org
Redirect Base	wbur.org
Domain IPs	54.235.254.104
Redirect IPs	54.235.254.104
Response IP	54.235.254.104
Found	Yes
Hash	f7a8c1563fa4b392bee5a14d552d84fed43c192b9500eef3a8bf5300529db12d
SimHash	5c1099d0a2b1

Groups

*

Rule	Path
Allow	/
Disallow	/*/archive/
Disallow	/circle-round-club*
Disallow	/radio/programs/hereandnow/tag/
Disallow	/radio/programs/hereandnow/topic/
Disallow	/radio/programs/onpoint/tag/
Disallow	/radio/programs/onpoint/topic/
Disallow	/opinion/section/cognoscenti/tag/
Disallow	/opinion/section/cognoscenti/topic/

Rule

Path

Allow

/

Disallow

/*/archive/

Disallow

/circle-round-club*

Disallow

/radio/programs/hereandnow/tag/

Disallow

/radio/programs/hereandnow/topic/

Disallow

/radio/programs/onpoint/tag/

Disallow

/radio/programs/onpoint/topic/

Disallow

/opinion/section/cognoscenti/tag/

Disallow

/opinion/section/cognoscenti/topic/

gptbot
amazonbot
anthropic-ai
applebot-extended
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claude-web
claudebot
cohere-ai
dataforseobot
diffbot
facebookbot
friendlycrawler
google-extended
imagesiftbot
img2dataset
magpie-crawler
meltwater
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
piplbot
scoop.it
seekr
youbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://www.wbur.org/sitemap.xml.gz
sitemap	https://www.wbur.org/sitemap-googlenews.xml.gz

Field

Value

sitemap

https://www.wbur.org/sitemap.xml.gz

sitemap

https://www.wbur.org/sitemap-googlenews.xml.gz

Back to top

Comments

Hello robot!
Block AI Crawlers

Back to top

wbur.orgrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

Comments

wbur.org
robots.txt