astra-j.de
robots.txt

Robots Exclusion Standard data for astra-j.de

Resource Scan

Scan Details

Site Domain astra-j.de
Base Domain astra-j.de
Scan Status Ok
Last Scan2024-06-27T22:43:15+00:00
Next Scan 2024-07-04T22:43:15+00:00

Last Scan

Scanned2024-06-27T22:43:15+00:00
URL https://astra-j.de/robots.txt
Redirect https://www.astra-j.de/robots.txt
Redirect Domain www.astra-j.de
Redirect Base astra-j.de
Domain IPs 104.21.33.129, 172.67.145.48
Redirect IPs 104.21.33.129, 172.67.145.48
Response IP 172.67.145.48
Found Yes
Hash 110390738f4dab2ee0a5af24f2c4d32a9603dc77ed65fcd0511c3391683a418a
SimHash 283edbf08443

Groups

boardreader

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

mediapartners-google

Rule Path
Disallow

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

spinn3r

Rule Path
Disallow /
Disallow /members/
Disallow /online/
Disallow /recent-activiy/
Disallow /find-new/
Disallow /misc/
Disallow /hilfe/
Disallow /search/
Disallow /posts/
Disallow /tags/
Disallow /affiliate/
Disallow /account/

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.astra-j.de/sitemap.php

Warnings

  • 2 invalid lines.