apek.cz
robots.txt
Robots Exclusion Standard data for apek.cz
Resource Scan
Scan Details
Site Domain | apek.cz |
Base Domain | apek.cz |
Scan Status | Ok |
Last Scan | 2024-11-16T19:21:08+00:00 |
Next Scan | 2024-12-16T19:21:08+00:00 |
Last Scan
Scanned | 2024-11-16T19:21:08+00:00 |
URL | https://www.apek.cz/robots.txt |
Domain IPs | 46.28.111.98 |
Response IP | 46.28.111.98 |
Found | Yes |
Hash | 8488a024128b12b21cf394b1975470fea796ea2306ec5ca65da956f50642d18e |
SimHash | 7307637301a7 |
Groups
*
Rule | Path |
---|---|
Disallow | /cms/ |
Disallow | /articles/read/ |
Disallow | /competitions/read/ |
Disallow | /jobs/read/ |
Disallow | /calendars/read/ |
Disallow | /redirect/ |
Disallow | /index.php/redirect/ |
Disallow | /bannerove-kampane |
ai2bot
ai2bot-dolma
amazonbot
applebot
applebot-extended
bytespider
ccbot
chatgpt-user
claude-web
claudebot
diffbot
facebookbot
friendlycrawler
gptbot
google-extended
googleother
googleother-image
googleother-video
icc-crawler
isscyberriskcrawler
imagesiftbot
kangaroo bot
meta-externalagent
meta-externalfetcher
oai-searchbot
perplexitybot
petalbot
scrapy
sidetrade indexer bot
timpibot
velenpublicwebcrawler
webzio-extended
youbot
anthropic-ai
cohere-ai
iaskspider/2.0
img2dataset
omgili
omgilibot
Rule | Path |
---|---|
Disallow | / |
Warnings
- 10 invalid lines.