iharstad.no
robots.txt

Robots Exclusion Standard data for iharstad.no

Resource Scan

Scan Details

Site Domain iharstad.no
Base Domain iharstad.no
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-10-06T07:02:48+00:00
Next Scan 2025-11-05T07:02:48+00:00

Last Successful Scan

Scanned2025-09-25T00:31:19+00:00
URL https://iharstad.no/robots.txt
Redirect https://www.iharstad.no/robots.txt
Redirect Domain www.iharstad.no
Redirect Base iharstad.no
Domain IPs 2a02:c0:ac::e51:1, 87.238.38.1, 87.238.38.2
Redirect IPs 104.18.22.107, 104.18.23.107, 2606:4700::6812:166b, 2606:4700::6812:176b
Response IP 104.18.23.107
Found Yes
Hash 2697da9d27c06925dc7a8ec862aaf9fdf76106fe79587035d08e53e050d2bafc
SimHash 500cc8c0e413

Groups

*

Rule Path
Allow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

Comments

  • Start AI crawler block
  • End AI crawler block