tennis-weblog.de
robots.txt

Robots Exclusion Standard data for tennis-weblog.de

Resource Scan

Scan Details

Site Domain tennis-weblog.de
Base Domain tennis-weblog.de
Scan Status Ok
Last Scan2024-10-09T18:07:08+00:00
Next Scan 2024-10-16T18:07:08+00:00

Last Scan

Scanned2024-10-09T18:07:08+00:00
URL https://tennis-weblog.de/robots.txt
Redirect https://www.tennis-weblog.de/robots.txt
Redirect Domain www.tennis-weblog.de
Redirect Base tennis-weblog.de
Domain IPs 94.102.220.119
Redirect IPs 94.102.220.119
Response IP 94.102.220.119
Found Yes
Hash 4a199428a0dd2b0c0932d1679d246cf6dd4bd339cd22495f92c5c0b3a4100c3a
SimHash d05b59b3c0a0

Groups

aboundexbot
ahrefsbot
aihitbot
amazonbot
anthropic-ai
applebot
applebot-extended
archive.org_bot
backlinkcrawler
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cliqzbot
cohere-ai
dataprovider
diffbot
domaincrawler
dotbot
easouspider
ec2linkfinder
exabot
ezooms
facebookbot
facebookexternalhit
fetch
friendlycrawler
genieo
go-http-client/2.0
gptbot
grub-client
httrack
ia_archiver
ia_archiver/1.6
ia_archiver-web.archive.org
icc-crawler
imagesiftbot
img2dataset
infopath
infopath.2
ip-web-crawler.com
libwww
linkpadbot
mail.ru
meanpathbot
meta-externalagent
meta-externalfetcher
microsoft.url.control
mj12bot
mozilla/4.0
msiecrawler
netestate ne crawler
npbot
oai-searchbot
offline explorer
omgili
omgilibot
panscient.com
perplexitybot
psbot
scrapy
screaming frog seo spider
searchmetericsbot
searchspider
semrushbot
seokicks-robot
sitebot
sitecheck.internetseer.com
sitesnagger
sosospider
spbot
swebot
taptubot
teleport
teleportpro
timpibot
turnitinbot
twengabot
twiceler
ubicrawler
velenpublicwebcrawler
vscooter
wbsearchbot
webcapture
webcopier
webreaper
webstripper
webzip
wget
wotbox
xenu
xenu's
xenu's link sleuth 1.1c
yandex
youbot
zealbot

Rule Path
Disallow /

*

Rule Path
Disallow *%26preview%3D*
Disallow *?s=*
Disallow /?s=
Disallow /comments/
Disallow */comments/
Disallow /feed/
Disallow */feed/
Disallow /rss/
Disallow */rss/
Disallow /trackback/
Disallow */trackback/
Disallow /cgi-bin/
Disallow /logs/
Disallow /usage/
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /webalizer/
Disallow /wp-content/banners/
Disallow /logos/
Disallow /rating_
Disallow /youtube/
Disallow /widget/
Disallow /zu/

Other Records

Field Value
sitemap https://www.tennis-weblog.de/sitemap.xml
sitemap https://www.tennis-weblog.de/google-news-sitemap.xml

Comments

  • Scraper
  • Generell