dariusz.wieckiewicz.org
robots.txt
Robots Exclusion Standard data for dariusz.wieckiewicz.org
Resource Scan
Scan Details
Site Domain | dariusz.wieckiewicz.org |
Base Domain | wieckiewicz.org |
Scan Status | Ok |
Last Scan | 2024-10-25T17:23:55+00:00 |
Next Scan | 2024-11-24T17:23:55+00:00 |
Last Scan
Scanned | 2024-10-25T17:23:55+00:00 |
URL | https://dariusz.wieckiewicz.org/robots.txt |
Domain IPs | 18.139.194.139, 2406:da18:880:3801::c8, 2406:da18:b3d:e202::64, 46.137.195.11 |
Response IP | 13.251.96.10 |
Found | Yes |
Hash | 22ab3fab64548d7badbff98e400171d3e7684ef1887de4c6eac882f74c447e7e |
SimHash | 7014cb11c203 |
Groups
*
Rule | Path |
---|---|
Disallow | /pobierz/* |
ai2bot
ai2bot-dolma
amazonbot
applebot
applebot-extended
bytespider
ccbot
chatgpt-user
claude-web
claudebot
diffbot
friendlycrawler
gptbot
icc-crawler
imagesiftbot
oai-searchbot
perplexitybot
petalbot
scrapy
timpibot
velenpublicwebcrawler
webzio-extended
youbot
anthropic-ai
cohere-ai
img2dataset
omgili
omgilibot
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://dariusz.wieckiewicz.org/sitemap.xml |
sitemap | https://dariusz.wieckiewicz.org/pl/sitemap.xml |
sitemap | https://dariusz.wieckiewicz.org/en/sitemap.xml |
sitemap | https://dariusz.wieckiewicz.org/pl/imagessitemap.xml |
sitemap | https://dariusz.wieckiewicz.org/en/imagessitemap.xml |