planetapolska.com
robots.txt

Robots Exclusion Standard data for planetapolska.com

Resource Scan

Scan Details

Site Domain planetapolska.com
Base Domain planetapolska.com
Scan Status Ok
Last Scan2024-11-14T06:26:49+00:00
Next Scan 2024-11-21T06:26:49+00:00

Last Scan

Scanned2024-11-14T06:26:49+00:00
URL https://planetapolska.com/robots.txt
Domain IPs 104.26.4.176, 104.26.5.176, 172.67.68.183, 2606:4700:20::681a:4b0, 2606:4700:20::681a:5b0, 2606:4700:20::ac43:44b7
Response IP 104.26.4.176
Found Yes
Hash aca58707777b4c4ff9d6f010b4493d26dd9c9bf482f1d7b66ebc87b541c841fd
SimHash 26098c82e790

Groups

*

Rule Path
Disallow */search?query=*
Disallow *?p=*
Disallow *?s=*

Other Records

Field Value
sitemap https://planetapolska.com/sitemap.xml
sitemap https://planetapolska.com/feed/googlenews/googlenews.xml

Warnings

  • `host` is not a known field.