ipu-system.de
robots.txt

Robots Exclusion Standard data for ipu-system.de

Resource Scan

Scan Details

Site Domain ipu-system.de
Base Domain ipu-system.de
Scan Status Ok
Last Scan2025-10-28T03:21:49+00:00
Next Scan 2025-11-27T03:21:49+00:00

Last Scan

Scanned2025-10-28T03:21:49+00:00
URL https://ipu-system.de/robots.txt
Redirect https://www.ipu-system.de/robots.txt
Redirect Domain www.ipu-system.de
Redirect Base ipu-system.de
Domain IPs 185.228.136.112, 2a03:4000:23:316::1
Redirect IPs 185.228.136.112, 2a03:4000:23:316::1
Response IP 185.228.136.112
Found Yes
Hash 7c8f6cfd81afd0809ace88c5f30be848283f5b131fccaae32be34426e483f91e
SimHash 76d44b41c645

Groups

*

Rule Path
Disallow /icon/
Disallow /images/
Disallow /cgi-bin/

stress-agent

Rule Path
Disallow /

ahrefsbot
ai2bot
ai2bot-dolma
aihitbot
amazonbot
anthropic-ai
applebot
applebot-extended
bitsightbot
brightbot 1.0
bytespider
ccbot
chatgpt-user
claude-searchbot
claude-user
claude-web
claudebot
cohere-ai
cohere-training-data-crawler
cotoyogi
crawlspace
diffbot
duckassistbot
facebookbot
factset_spyderbot
firecrawlagent
friendlycrawler
google-cloudvertexbot
google-extended
googleother
googleother-image
googleother-video
gptbot
iaskspider/2.0
icc-crawler
imagesiftbot
img2dataset
imgproxy
isscyberriskcrawler
kangaroo bot
meta-externalagent
meta-externalagent
meta-externalfetcher
meta-externalfetcher
mistralai-user/1.0
novaact
oai-searchbot
omgili
omgilibot
operator
pangubot
perplexity-user
perplexitybot
petalbot
qualifiedbot
scrapy
semrushbot/7~bl
semrushbot-ocob
semrushbot-swa
sidetrade indexer bot
tiktokspider
timpibot
velenpublicwebcrawler
webzio-extended
wpbot
youbot
headlesschrome

Rule Path
Disallow /

Comments

  • exclude help system from robots
  • disallow stress test
  • AI bots