din-erhvervsguide.dk
robots.txt

Robots Exclusion Standard data for din-erhvervsguide.dk

Resource Scan

Scan Details

Site Domain din-erhvervsguide.dk
Base Domain din-erhvervsguide.dk
Scan Status Ok
Last Scan2024-11-13T22:10:14+00:00
Next Scan 2024-11-20T22:10:14+00:00

Last Scan

Scanned2024-11-13T22:10:14+00:00
URL https://din-erhvervsguide.dk/robots.txt
Redirect https://www.herningfolkeblad.dk/robots.txt
Redirect Domain www.herningfolkeblad.dk
Redirect Base herningfolkeblad.dk
Domain IPs 104.21.27.171, 172.67.169.150, 2606:4700:3034::6815:1bab, 2606:4700:3035::ac43:a996
Redirect IPs 13.50.30.140, 13.51.18.34, 16.170.142.181
Response IP 13.50.30.140
Found Yes
Hash 0d406ebf2dda328328ea69111a4ab3cd7b47fa8411a2e409e94853b7c27de8e8
SimHash 5a345840a913

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /