iglanc.cz
robots.txt

Robots Exclusion Standard data for iglanc.cz

Resource Scan

Scan Details

Site Domain iglanc.cz
Base Domain iglanc.cz
Scan Status Ok
Last Scan2024-11-14T06:25:53+00:00
Next Scan 2024-11-21T06:25:53+00:00

Last Scan

Scanned2024-11-14T06:25:53+00:00
URL https://iglanc.cz/robots.txt
Redirect https://www.iglanc.cz/robots.txt
Redirect Domain www.iglanc.cz
Redirect Base iglanc.cz
Domain IPs 104.21.20.128, 172.67.192.230, 2606:4700:3030::6815:1480, 2606:4700:3035::ac43:c0e6
Redirect IPs 104.21.20.128, 172.67.192.230, 2606:4700:3030::6815:1480, 2606:4700:3035::ac43:c0e6
Response IP 104.21.20.128
Found Yes
Hash 233d2aec2a4db30cd3a0febf2f9747351483287b1a6193f69f0ad10029384b37
SimHash 80157064ee93

Groups

ia_archiver

Rule Path
Disallow /

grapeshot

Rule Path
Disallow

machinelearning

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.iglanc.cz/sitemap.xml