freitag.de
robots.txt
Robots Exclusion Standard data for freitag.de
Resource Scan
Scan Details
Site Domain | freitag.de |
Base Domain | freitag.de |
Scan Status | Ok |
Last Scan | 2024-09-20T07:31:24+00:00 |
Next Scan | 2024-09-27T07:31:24+00:00 |
Last Scan
Scanned | 2024-09-20T07:31:24+00:00 |
URL | https://freitag.de/robots.txt |
Redirect | https://www.freitag.de/robots.txt |
Redirect Domain | www.freitag.de |
Redirect Base | freitag.de |
Domain IPs | 185.105.252.15, 2a02:248:101:62::1286 |
Redirect IPs | 185.105.252.15, 2a02:248:101:62::1286 |
Response IP | 185.105.252.15 |
Found | Yes |
Hash | b3c3c4d734ff7096dbab21171323a94c8794fdc93e199345c170fc66eb690907 |
SimHash | 769dd341d1a5 |
Groups
*
Rule | Path |
---|---|
Disallow | /acl_users/session/ |
Disallow | /acl_users/credentials_cookie_auth/ |
googlebot
Rule | Path |
---|---|
Disallow | /*%40%40search*$ |
Disallow | /acl_users/session/ |
Disallow | /acl_users/credentials_cookie_auth/ |
bingbot
Rule | Path |
---|---|
Disallow | /*%40%40search*$ |
Disallow | /acl_users/session/ |
Disallow | /acl_users/credentials_cookie_auth/ |
Other Records
Field | Value |
---|---|
crawl-delay | 60 |
amazonbot
anthropic-ai
applebot-extended
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
friendlycrawler
google-extended
googleother
gptbot
img2dataset
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
youbot
meta-externalagent
imagesiftbot
Rule | Path |
---|---|
Disallow | / |
Allow | /$ |
Allow | /ueber |
Allow | /redaktion |
Allow | /presse |
Allow | /partner |
Allow | /impressum |
Allow | /agb |
Allow | /faq |
gigabot
msnbot
teoma
slurp
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 60 |
aipbot
alexibot
aqua_products
archive.org_bot
asterias
b2w/0.1
backdoorbot/1.0
becomebot
blowfish/1.0
bookmark search tool
botalot
botrighthere
builtbottough
bullseye/1.0
bunnyslippers
cheesebot
cherrypicker
cherrypickerelite/1.0
cherrypickerse/1.0
copernic
copyrightcheck
cosmos
crescent
crescent internet toolpak http ole control v.1.0
dataforseobot
dittospyder
dotbot
emailcollector
emailsiphon
emailwolf
erocrawler
extractorpro
fairad client
fasterfox
flaming attackbot
foobot
gaisbot
getright/4.2
glonaad
harvest/1.5
hloader
httplib
httrack 3.0
humanlinks
img2dataset
infonavirobot
iron33/1.0.2
jennybot
kenjin spider
larbin
lexibot
libweb/clshttp
linkextractorpro
linkscan/8.1a unix
linkwalker
lnspiderguy
lwp-trivial
lwp-trivial/1.34
mata hari
microsoft url control
microsoft url control - 5.01.4511
microsoft url control - 6.00.8169
miixpc
miixpc/4.2
mister pix
mj12bot
moget
moget/2.1
mozilla/4.0 (compatible; bullseye; windows 95)
ms search
msiecrawler
netants
nicerspro
ocelli
offline explorer
openbot
openfind
openfind data gatherer
oracle ultra search
perman
propowerbot/2.14
prowebwalker
proximic
psbot
queryn metasearch
radiation retriever 1.1
repomonkey
repomonkey bait & tackle/v1.01
riddler
rma
searchpreview
semrushbot
sitesnagger
spankbot
spanner
speedy
squidbot
surveybot
suzuran
szukacz/1.4
teleport
teleportpro
telesoft
the intraformant
thenomad
tighttwatbot
tocrawl/urldispatcher
true_robot
true_robot/1.0
turingos
turnitinbot
turnitinbot/1.5
twiceler
um-fc
url control
url_spider_pro
urly warning
vci
vci webviewer vci webviewer win32
web image collector
webauto
webbandit
webbandit/3.50
webcapture 2.0
webcopier
webcopier v.2.2
webcopier v3.2a
webenhancer
websauger
website quester
webster pro
webstripper
webzip
webzip/4.0
webzip/4.21
webzip/5.0
www-collector-e
zeus
zeus 32297 webster pro v2.9 win32
zeus link scout
Product | Comment |
---|---|
ms search | This is Sharepoint Portal Server, not the MSN search engine, so we block it. |
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.freitag.de/sitemap.xml |
Warnings
- 1 invalid line.
Comments