berliner-kurier.de
robots.txt

Robots Exclusion Standard data for berliner-kurier.de

Resource Scan

Scan Details

Site Domain berliner-kurier.de
Base Domain berliner-kurier.de
Scan Status Ok
Last Scan2024-06-15T05:20:59+00:00
Next Scan 2024-06-22T05:20:59+00:00

Last Scan

Scanned2024-06-15T05:20:59+00:00
URL https://berliner-kurier.de/robots.txt
Redirect https://www.berliner-kurier.de/robots.txt
Redirect Domain www.berliner-kurier.de
Redirect Base berliner-kurier.de
Domain IPs 104.22.28.202, 104.22.29.202, 172.67.31.244, 2606:4700:10::6816:1cca, 2606:4700:10::6816:1dca, 2606:4700:10::ac43:1ff4
Redirect IPs 104.22.28.202, 104.22.29.202, 172.67.31.244, 2606:4700:10::6816:1cca, 2606:4700:10::6816:1dca, 2606:4700:10::ac43:1ff4
Response IP 104.22.28.202
Found Yes
Hash a8e78c7d62b37d56b5ac1170c5bf8218cccbd46712ff6282581acdca5e087b78
SimHash 010cca508153

Groups

echoboxbot

Rule Path
Allow /

mozilla/5.0 (compatible; ogdwctxcrawler)

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

*

Rule Path
Disallow /suche?q=
Disallow /jos-data
Disallow /api/reactions
Disallow /api/profile
Disallow /*.woff2$
Disallow /*.woff$
Disallow /*.ttf$

Other Records

Field Value
sitemap https://www.berliner-kurier.de/feed.xml
sitemap https://www.berliner-kurier.de/sitemap.xml
sitemap https://www.berliner-kurier.de/news-sitemap.xml