energie-wissen.info
robots.txt

Robots Exclusion Standard data for energie-wissen.info

Resource Scan

Scan Details

Site Domain energie-wissen.info
Base Domain energie-wissen.info
Scan Status Ok
Last Scan2024-09-30T10:15:44+00:00
Next Scan 2024-10-07T10:15:44+00:00

Last Scan

Scanned2024-09-30T10:15:44+00:00
URL https://energie-wissen.info/robots.txt
Redirect https://www.energie-wissen.info/robots.txt
Redirect Domain www.energie-wissen.info
Redirect Base energie-wissen.info
Domain IPs 94.102.220.193
Redirect IPs 94.102.220.193
Response IP 94.102.220.193
Found Yes
Hash 904ff929a7e12fa2a78f331f7db8743c7aa56fab4f6e8ea37cf48e9956889886
SimHash d05a49b3c0a6

Groups

googlebot-image

Rule Path
Disallow /youtube/

aboundexbot
ahrefsbot
aihitbot
amazonbot
anthropic-ai
applebot
applebot-extended
archive.org_bot
backlinkcrawler
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cliqzbot
cohere-ai
dataprovider
diffbot
domaincrawler
dotbot
easouspider
ec2linkfinder
exabot
ezooms
facebookbot
facebookexternalhit
fetch
friendlycrawler
genieo
go-http-client/2.0
gptbot
grub-client
httrack
ia_archiver
ia_archiver/1.6
ia_archiver-web.archive.org
icc-crawler
imagesiftbot
img2dataset
infopath
infopath.2
ip-web-crawler.com
libwww
linkpadbot
mail.ru
meanpathbot
meta-externalagent
meta-externalfetcher
microsoft.url.control
mj12bot
mozilla/4.0
msiecrawler
netestate ne crawler
npbot
oai-searchbot
offline explorer
omgili
omgilibot
panscient.com
perplexitybot
psbot
scrapy
screaming frog seo spider
searchmetericsbot
searchspider
semrushbot
seokicks-robot
sitebot
sitecheck.internetseer.com
sitesnagger
sosospider
spbot
swebot
taptubot
teleport
teleportpro
timpibot
turnitinbot
twengabot
twiceler
ubicrawler
velenpublicwebcrawler
vscooter
wbsearchbot
webcapture
webcopier
webreaper
webstripper
webzip
wget
wotbox
xenu
xenu's
xenu's link sleuth 1.1c
yandex
youbot
zealbot

Rule Path
Disallow /

*

Rule Path
Disallow *%26preview%3D*
Disallow *?s=*
Disallow /?s=
Disallow /comments/
Disallow */comments/
Disallow /feed/
Disallow */feed/
Disallow /rss/
Disallow */rss/
Disallow /trackback/
Disallow */trackback/
Disallow /cgi-bin/
Disallow /logs/
Disallow /usage/
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /webalizer/

Comments

  • Bilder
  • Spider
  • Generell
  • Sitemap: https://www.example.com/sitemap.xml