buzzfeed.de
robots.txt

Robots Exclusion Standard data for buzzfeed.de

Archived Snapshots

Resource Scan

Scan Details

Site Domain	buzzfeed.de
Base Domain	buzzfeed.de
Scan Status	Ok
Last Scan	2024-10-28T07:25:53+00:00
Next Scan	2024-11-04T07:25:53+00:00

Last Scan

Scanned	2024-10-28T07:25:53+00:00
URL	https://buzzfeed.de/robots.txt
Domain IPs	91.234.30.113
Response IP	91.234.30.113
Found	Yes
Hash	a6ffb4b39aa5b9510c4015b83035b33cfb9f17d09bf4c20384dd3bd5be8be073
SimHash	73111358af25

Groups

*

Rule	Path
Disallow	/lightweight-ajax
Disallow	/*?trafficsource
Disallow	/suche/
Disallow	/*?cmp=defrss
Disallow	/test/
Disallow	/fdn/bootstrap/
Disallow	/bi/bootstrap/
Disallow	/bi/doop/
Disallow	/sso/

Rule

Path

Disallow

/lightweight-ajax

Disallow

/*?trafficsource

Disallow

/suche/

Disallow

/*?cmp=defrss

Disallow

/test/

Disallow

/fdn/bootstrap/

Disallow

/bi/bootstrap/

Disallow

/bi/doop/

Disallow

/sso/

xovi

Rule	Path
Disallow	/

Rule

Path

Disallow

sistrix

Rule	Path
Disallow	/

Rule

Path

Disallow

searchmetricsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bingbot

Rule	Path
Disallow	/test/

Rule

Path

Disallow

/test/

gptbot

Rule	Path
Allow	/ueber-uns/
Disallow	/

Rule

Path

Allow

/ueber-uns/

Disallow

ccbot

Rule	Path
Allow	/ueber-uns/
Disallow	/

Rule

Path

Allow

/ueber-uns/

Disallow

msnbot

Rule	Path
Disallow	/test/

Rule

Path

Disallow

/test/

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

amazonbot
anthropic-ai
applebot-extended
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
dataforseobot
facebookbot
google-extended
imagesiftbot
magpie-crawler
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
youbot

Rule	Path
Disallow	/

Rule

Path

Disallow

Comments

robots.txt www.buzzfeed.de
Legal notice: www.buzzfeed.de expressly reserves the right to use its content for commercial text and data mining (§ 44b UrhG).
The use of robots or other automated means to access www.buzzfeed.de or collect or mine data without the express permission of www.buzzfeed.de is strictly prohibited.

buzzfeed.derobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

xovi

sistrix

searchmetricsbot

bingbot

gptbot

ccbot

msnbot

Other Records

amazonbotanthropic-aiapplebot-extendedawariorssbotawariosmartbotbytespiderccbotchatgpt-userclaudebotclaude-webcohere-aidataforseobotfacebookbotgoogle-extendedimagesiftbotmagpie-crawleromgiliomgilibotpeer39_crawlerpeer39_crawler/1.0perplexitybotyoubot

Comments

buzzfeed.de
robots.txt

amazonbot
anthropic-ai
applebot-extended
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
dataforseobot
facebookbot
google-extended
imagesiftbot
magpie-crawler
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
youbot