amerikahaus.de
robots.txt

Robots Exclusion Standard data for amerikahaus.de

Resource Scan

Scan Details

Site Domain amerikahaus.de
Base Domain amerikahaus.de
Scan Status Ok
Last Scan2026-02-09T19:42:44+00:00
Next Scan 2026-03-11T19:42:44+00:00

Last Scan

Scanned2026-02-09T19:42:44+00:00
URL https://amerikahaus.de/robots.txt
Redirect https://www.amerikahaus.de/robots.txt
Redirect Domain www.amerikahaus.de
Redirect Base amerikahaus.de
Domain IPs 176.52.244.20
Redirect IPs 176.52.244.20
Response IP 176.52.244.20
Found Yes
Hash 4eb8960b7e57aea3606db42952de9cbf8ec5343280148a44391ed9f592c7e91b
SimHash 76944b02c3c6

Groups

*

Rule Path
Disallow /*.ai$
Disallow /*.arw$
Disallow /*.cdr$
Disallow /*.cr2$
Disallow /*.dib$
Disallow /*.eps$
Disallow /*.heif$
Disallow /*.heic$
Disallow /*.ind$
Disallow /*.indd$
Disallow /*.indt$
Disallow /*.j2k$
Disallow /*.jif$
Disallow /*.jfif$
Disallow /*.jfi$
Disallow /*.jp2$
Disallow /*.jpe$
Disallow /*.jpf$
Disallow /*.jpx$
Disallow /*.jpm$
Disallow /*.k25$
Disallow /*.mj2$
Disallow /*.nrw$
Disallow /*.pdf$
Disallow /*.psd$
Disallow /*.raw$
Disallow /*.svgz$
Disallow /*.tga$
Disallow /*.tif$
Disallow /*.webp$
Disallow /fileadmin/

ai2bot
ai2bot-dolma
aihitbot
amazonbot
anthropic-ai
applebot
applebot-extended
brightbot 1.0
bytespider
ccbot
chatgpt-user
claude-web
claudebot
cohere-ai
cohere-training-data-crawler
cotoyogi
crawlspace
diffbot
duckassistbot
facebookbot
factset_spyderbot
firecrawlagent
friendlycrawler
google-extended
googleother
googleother-image
googleother-video
gptbot
iaskspider/2.0
icc-crawler
imagesiftbot
img2dataset
imgproxy
isscyberriskcrawler
kangaroo bot
meta-externalagent
meta-externalagent
meta-externalfetcher
meta-externalfetcher
novaact
oai-searchbot
omgili
omgilibot
operator
pangubot
perplexity-user
perplexitybot
petalbot
scrapy
sentibot
semrushbot-ocob
semrushbot-swa
sidetrade indexer bot
tiktokspider
timpibot
velenpublicwebcrawler
webzio-extended
youbot

Rule Path
Disallow /

Comments

  • www.robotstxt.org/
  • Allow crawling of all content