astrah.pl
robots.txt

Robots Exclusion Standard data for astrah.pl

Resource Scan

Scan Details

Site Domain astrah.pl
Base Domain astrah.pl
Scan Status Ok
Last Scan2024-09-24T23:14:34+00:00
Next Scan 2024-10-01T23:14:34+00:00

Last Scan

Scanned2024-09-24T23:14:34+00:00
URL https://astrah.pl/robots.txt
Redirect https://www.astrah.pl/robots.txt
Redirect Domain www.astrah.pl
Redirect Base astrah.pl
Domain IPs 185.110.51.100
Redirect IPs 185.110.51.100
Response IP 185.110.51.100
Found Yes
Hash cdfebb012838a1700d753d1d346e65b84cefffd3ddca703822aa301af04ed5d2
SimHash 94185d59c5f3

Groups

urlmetriken

Rule Path
Disallow /

orthogaffe

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

fast

Rule Path
Disallow /

Comments

  • advertising-related bots:
  • Crawlers that are kind enough to obey, but which we'd rather not have
  • unless they're feeding search engines.
  • Some bots are known to be trouble, particularly those designed to copy
  • entire sites. Please obey robots.txt.
  • Misbehaving: requests much too fast: