suchmaschinen-datenbank.at
robots.txt

Robots Exclusion Standard data for suchmaschinen-datenbank.at

Resource Scan

Scan Details

Site Domain suchmaschinen-datenbank.at
Base Domain suchmaschinen-datenbank.at
Scan Status Ok
Last Scan2024-10-09T18:01:41+00:00
Next Scan 2024-10-16T18:01:41+00:00

Last Scan

Scanned2024-10-09T18:01:41+00:00
URL https://suchmaschinen-datenbank.at/robots.txt
Redirect https://www.suchmaschinen-datenbank.at/robots.txt
Redirect Domain www.suchmaschinen-datenbank.at
Redirect Base suchmaschinen-datenbank.at
Domain IPs 94.102.220.119
Redirect IPs 94.102.220.119
Response IP 94.102.220.119
Found Yes
Hash ba961b4c45f43cd7ba3219d1cfa93757f1204a7017d12a8cf56e0f8da10f9aef
SimHash d01b69a3c0a2

Groups

aboundexbot
ahrefsbot
aihitbot
amazonbot
anthropic-ai
applebot
applebot-extended
archive.org_bot
backlinkcrawler
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cliqzbot
cohere-ai
dataprovider
diffbot
domaincrawler
dotbot
easouspider
ec2linkfinder
exabot
ezooms
facebookbot
facebookexternalhit
fetch
friendlycrawler
genieo
go-http-client/2.0
gptbot
grub-client
httrack
ia_archiver
ia_archiver/1.6
ia_archiver-web.archive.org
icc-crawler
imagesiftbot
img2dataset
infopath
infopath.2
ip-web-crawler.com
libwww
linkpadbot
mail.ru
meanpathbot
meta-externalagent
meta-externalfetcher
microsoft.url.control
mj12bot
mozilla/4.0
msiecrawler
netestate ne crawler
npbot
oai-searchbot
offline explorer
omgili
omgilibot
panscient.com
perplexitybot
psbot
scrapy
screaming frog seo spider
searchmetericsbot
searchspider
semrushbot
seokicks-robot
sitebot
sitecheck.internetseer.com
sitesnagger
sosospider
spbot
swebot
taptubot
teleport
teleportpro
timpibot
turnitinbot
twengabot
twiceler
ubicrawler
velenpublicwebcrawler
vscooter
wbsearchbot
webcapture
webcopier
webreaper
webstripper
webzip
wget
wotbox
xenu
xenu's
xenu's link sleuth 1.1c
yandex
youbot
zealbot

Rule Path
Disallow /

*

Rule Path
Disallow *%26preview%3D*
Disallow *?s=*
Disallow /?s=
Disallow /comments/
Disallow */comments/
Disallow /feed/
Disallow */feed/
Disallow /rss/
Disallow */rss/
Disallow /trackback/
Disallow */trackback/
Disallow /cgi-bin/
Disallow /logs/
Disallow /youtube/
Disallow /usage/
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /webalizer/
Disallow /name/
Allow /wp-content/dateien/
Allow /wp-content/themes/sd-theme/

Comments

  • Scraper
  • Generell