ceskaplaneta.net
robots.txt

Robots Exclusion Standard data for ceskaplaneta.net

Resource Scan

Scan Details

Site Domain ceskaplaneta.net
Base Domain ceskaplaneta.net
Scan Status Ok
Last Scan2024-11-11T14:29:21+00:00
Next Scan 2024-11-18T14:29:21+00:00

Last Scan

Scanned2024-11-11T14:29:21+00:00
URL https://ceskaplaneta.net/robots.txt
Domain IPs 104.21.64.116, 172.67.183.173, 2606:4700:3030::ac43:b7ad, 2606:4700:3032::6815:4074
Response IP 172.67.183.173
Found Yes
Hash d6183617405c2f88267710bcaa7ccb53464d759a2b29d67a226f8967fee4de24
SimHash e2492cc1ef90

Groups

*

Rule Path
Disallow */search?query=*
Disallow *?p=*
Disallow *?s=*

Other Records

Field Value
sitemap https://ceskaplaneta.net/sitemap.xml
sitemap https://ceskaplaneta.net/feed/googlenews/googlenews.xml

Warnings

  • `host` is not a known field.