pravda24.cz
robots.txt

Robots Exclusion Standard data for pravda24.cz

Resource Scan

Scan Details

Site Domain pravda24.cz
Base Domain pravda24.cz
Scan Status Ok
Last Scan2024-05-22T08:29:17+00:00
Next Scan 2024-05-29T08:29:17+00:00

Last Scan

Scanned2024-05-22T08:29:17+00:00
URL https://pravda24.cz/robots.txt
Domain IPs 2a0e:acc0::c27, 2a0e:acc0::c28, 45.138.107.27, 45.138.107.28
Response IP 45.138.107.28
Found Yes
Hash e8e33f23596b1036ee91c78a304c5e70716faa9ebb2f063ac2eaf68f624345b0
SimHash 6c2d4f455c87

Groups

seznambot
googlebot

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://pravda24.cz/sitemap.xml

Comments

  • pro SeznamBota: neprohledávat /cz/chat/, rychlost 10 URL za minutu
  • pro Googlebota: neprohledávat /logs/, rychlost 10 URL za minutu

Warnings

  • `request-rate` is not a known field.