kleinanzeigen.deine-tierwelt.de
robots.txt

Robots Exclusion Standard data for kleinanzeigen.deine-tierwelt.de

Resource Scan

Scan Details

Site Domain kleinanzeigen.deine-tierwelt.de
Base Domain deine-tierwelt.de
Scan Status Ok
Last Scan2024-11-08T06:40:28+00:00
Next Scan 2024-11-22T06:40:28+00:00

Last Scan

Scanned2024-11-08T06:40:28+00:00
URL https://kleinanzeigen.deine-tierwelt.de/robots.txt
Domain IPs 3.160.212.12, 3.160.212.122, 3.160.212.31, 3.160.212.43
Response IP 13.226.2.61
Found Yes
Hash 560bc897ae11a21404da60abc7c7595ebd37d8d7d974b99b5d9bbfe5a0e00bf7
SimHash 2940c3d3d7b5

Groups

*

Rule Path
Allow /ads.txt
Disallow /

googlebot
googlebot-image
googlebot-mobile
googlebot-news
googlebot-video
mediapartners-google
mediapartners (googlebot)
adsbot-google
feedfetcher-google
ia_archiver
bingbot
heritrix
msnbot
slurp
gumgum
teoma
ebay relevance ad crawler
ebay relevance ad crawler powered by contentdetection (www.mindup.de)
browsershots
twitterbot
termlabs
ahrefssiteaudit

Rule Path
Allow /
Disallow /anzeigenlesen_alt/
Disallow /aza/
Disallow /anzeige_aufgeben/
Disallow /booking/
Disallow /extra/trap/trap.php
Disallow /feed/
Disallow /js_menu/
Disallow /recherche_inkasso/
Disallow /tracking/
Disallow /ajax/

grapeshot

Rule Path
Disallow

Comments

  • robots.txt fuer www.dhd24.com
  • [siehe http://selfhtml.teamone.de/diverses/robots.htm]
  • Allgemein
  • bestimmte Bots freigeben