krzysztofbrzozowski.com
robots.txt

Robots Exclusion Standard data for krzysztofbrzozowski.com

Resource Scan

Scan Details

Site Domain krzysztofbrzozowski.com
Base Domain krzysztofbrzozowski.com
Scan Status Ok
Last Scan2025-05-16T08:35:16+00:00
Next Scan 2025-06-15T08:35:16+00:00

Last Scan

Scanned2025-05-16T08:35:16+00:00
URL https://krzysztofbrzozowski.com/robots.txt
Domain IPs 104.21.66.76, 172.67.157.128, 2606:4700:3035::ac43:9d80, 2606:4700:3037::6815:424c
Response IP 104.21.66.76
Found Yes
Hash c23f4b4c3c737efddfaa3e5ecec764e4666eb1bfbf7548faac40a63730e23c11
SimHash 4d768c6acdd3

Groups

*

Rule Path
Disallow /search/*
Disallow /search/
Disallow /*.pdf
Disallow /*.ppt
Disallow /*.doc
Disallow /*.xls
Disallow /*.txt
Disallow /515*

Other Records

Field Value
sitemap https://krzysztofbrzozowski.com/sitemap.xml

Warnings

  • `host` is not a known field.