kleinanzeigen.deine-tierwelt.de
robots.txt

Robots Exclusion Standard data for kleinanzeigen.deine-tierwelt.de

Resource Scan

Scan Details

Site Domain kleinanzeigen.deine-tierwelt.de
Base Domain deine-tierwelt.de
Scan Status Ok
Last Scan2024-09-27T06:25:05+00:00
Next Scan 2024-10-11T06:25:05+00:00

Last Scan

Scanned2024-09-27T06:25:05+00:00
URL https://kleinanzeigen.deine-tierwelt.de/robots.txt
Domain IPs 108.138.26.120, 108.138.26.44, 108.138.26.46, 108.138.26.81
Response IP 108.156.22.14
Found Yes
Hash 560bc897ae11a21404da60abc7c7595ebd37d8d7d974b99b5d9bbfe5a0e00bf7
SimHash 2940c3d3d7b5

Groups

*

Rule Path
Allow /ads.txt
Disallow /

googlebot
googlebot-image
googlebot-mobile
googlebot-news
googlebot-video
mediapartners-google
mediapartners (googlebot)
adsbot-google
feedfetcher-google
ia_archiver
bingbot
heritrix
msnbot
slurp
gumgum
teoma
ebay relevance ad crawler
ebay relevance ad crawler powered by contentdetection (www.mindup.de)
browsershots
twitterbot
termlabs
ahrefssiteaudit

Rule Path
Allow /
Disallow /anzeigenlesen_alt/
Disallow /aza/
Disallow /anzeige_aufgeben/
Disallow /booking/
Disallow /extra/trap/trap.php
Disallow /feed/
Disallow /js_menu/
Disallow /recherche_inkasso/
Disallow /tracking/
Disallow /ajax/

grapeshot

Rule Path
Disallow

Comments

  • robots.txt fuer www.dhd24.com
  • [siehe http://selfhtml.teamone.de/diverses/robots.htm]
  • Allgemein
  • bestimmte Bots freigeben