aalborgnu.dk
robots.txt

Robots Exclusion Standard data for aalborgnu.dk

Resource Scan

Scan Details

Site Domain aalborgnu.dk
Base Domain aalborgnu.dk
Scan Status Ok
Last Scan2024-11-02T10:43:15+00:00
Next Scan 2024-11-09T10:43:15+00:00

Last Scan

Scanned2024-11-02T10:43:15+00:00
URL https://aalborgnu.dk/robots.txt
Redirect https://ligeher.nu/robots.txt
Redirect Domain ligeher.nu
Redirect Base ligeher.nu
Domain IPs 194.88.217.47
Redirect IPs 52.233.184.181
Response IP 52.233.184.181
Found Yes
Hash 985fa24f5309db528a4b872a3b260fd90ce0d2b134ded98ca7f8894c5e542851
SimHash 30329a3087e7

Groups

*

Rule Path
Allow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

Other Records

Field Value
sitemap https://ligeher.nu/sitemap.xml

Comments

  • AI crawler reference
  • The link below provides instructions to what kind of content can be used to train AI models on this website
  • https://ligeher.nu/ai.txt
  • Search engines
  • Common crawl
  • OpenAI (ChatGPT)
  • OpenAI (ChatGPT realtime search)
  • Anthropic
  • Sitemap