freitag.de
robots.txt

Robots Exclusion Standard data for freitag.de

Resource Scan

Scan Details

Site Domain freitag.de
Base Domain freitag.de
Scan Status Ok
Last Scan2024-09-20T07:31:24+00:00
Next Scan 2024-09-27T07:31:24+00:00

Last Scan

Scanned2024-09-20T07:31:24+00:00
URL https://freitag.de/robots.txt
Redirect https://www.freitag.de/robots.txt
Redirect Domain www.freitag.de
Redirect Base freitag.de
Domain IPs 185.105.252.15, 2a02:248:101:62::1286
Redirect IPs 185.105.252.15, 2a02:248:101:62::1286
Response IP 185.105.252.15
Found Yes
Hash b3c3c4d734ff7096dbab21171323a94c8794fdc93e199345c170fc66eb690907
SimHash 769dd341d1a5

Groups

*

Rule Path
Disallow /acl_users/session/
Disallow /acl_users/credentials_cookie_auth/

googlebot

Rule Path
Disallow /*%40%40search*$
Disallow /acl_users/session/
Disallow /acl_users/credentials_cookie_auth/

bingbot

Rule Path
Disallow /*%40%40search*$
Disallow /acl_users/session/
Disallow /acl_users/credentials_cookie_auth/

Other Records

Field Value
crawl-delay 60

applebot/0.1

Rule Path
Disallow /*%40%40search*$

seznambot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

amazonbot
anthropic-ai
applebot-extended
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
friendlycrawler
google-extended
googleother
gptbot
img2dataset
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
youbot
meta-externalagent
imagesiftbot

Rule Path
Disallow /
Allow /$
Allow /ueber
Allow /redaktion
Allow /presse
Allow /partner
Allow /impressum
Allow /agb
Allow /faq

img2dataset

Rule Path
Disallow /

gigabot
msnbot
teoma
slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 60

pinterest/nutch-2.3

Rule Path
Disallow /*%40%40search*$

Other Records

Field Value
crawl-delay 60

seokicks-robot
ahrefsbot
keyword density/0.9
xenu's
xenu's link sleuth 1.1c

Rule Path
Disallow /

aipbot
alexibot
aqua_products
archive.org_bot
asterias
b2w/0.1
backdoorbot/1.0
becomebot
blowfish/1.0
bookmark search tool
botalot
botrighthere
builtbottough
bullseye/1.0
bunnyslippers
cheesebot
cherrypicker
cherrypickerelite/1.0
cherrypickerse/1.0
copernic
copyrightcheck
cosmos
crescent
crescent internet toolpak http ole control v.1.0
dataforseobot
dittospyder
dotbot
emailcollector
emailsiphon
emailwolf
erocrawler
extractorpro
fairad client
fasterfox
flaming attackbot
foobot
gaisbot
getright/4.2
glonaad
harvest/1.5
hloader
httplib
httrack 3.0
humanlinks
img2dataset
infonavirobot
iron33/1.0.2
jennybot
kenjin spider
larbin
lexibot
libweb/clshttp
linkextractorpro
linkscan/8.1a unix
linkwalker
lnspiderguy
lwp-trivial
lwp-trivial/1.34
mata hari
microsoft url control
microsoft url control - 5.01.4511
microsoft url control - 6.00.8169
miixpc
miixpc/4.2
mister pix
mj12bot
moget
moget/2.1
mozilla/4.0 (compatible; bullseye; windows 95)
ms search
msiecrawler
netants
nicerspro
ocelli
offline explorer
openbot
openfind
openfind data gatherer
oracle ultra search
perman
propowerbot/2.14
prowebwalker
proximic
psbot
queryn metasearch
radiation retriever 1.1
repomonkey
repomonkey bait & tackle/v1.01
riddler
rma
searchpreview
semrushbot
sitesnagger
spankbot
spanner
speedy
squidbot
surveybot
suzuran
szukacz/1.4
teleport
teleportpro
telesoft
the intraformant
thenomad
tighttwatbot
tocrawl/urldispatcher
true_robot
true_robot/1.0
turingos
turnitinbot
turnitinbot/1.5
twiceler
um-fc
url control
url_spider_pro
urly warning
vci
vci webviewer vci webviewer win32
web image collector
webauto
webbandit
webbandit/3.50
webcapture 2.0
webcopier
webcopier v.2.2
webcopier v3.2a
webenhancer
websauger
website quester
webster pro
webstripper
webzip
webzip/4.0
webzip/4.21
webzip/5.0
www-collector-e
zeus
zeus 32297 webster pro v2.9 win32
zeus link scout

Product Comment
ms search This is Sharepoint Portal Server, not the MSN search engine, so we block it.
Rule Path
Disallow /

favicon
iconsurf

Rule Path
Disallow /favicon.ico

Other Records

Field Value
sitemap https://www.freitag.de/sitemap.xml

Comments

  • Bing supposedly also supports wildcards
  • Wingmen asked to block these two
  • LLM-Blocklist
  • Crawl-Delays
  • SEO tools
  • Blocklist

Warnings

  • 1 invalid line.