camdennewjournal.com
robots.txt

Robots Exclusion Standard data for camdennewjournal.com

Resource Scan

Scan Details

Site Domain camdennewjournal.com
Base Domain camdennewjournal.com
Scan Status Ok
Last Scan2024-11-13T17:12:29+00:00
Next Scan 2024-11-20T17:12:29+00:00

Last Scan

Scanned2024-11-13T17:12:29+00:00
URL https://camdennewjournal.com/robots.txt
Redirect https://www.camdennewjournal.co.uk/robots.txt
Redirect Domain www.camdennewjournal.co.uk
Redirect Base camdennewjournal.co.uk
Domain IPs 109.228.46.244
Redirect IPs 109.228.46.244
Response IP 109.228.46.244
Found Yes
Hash 09b9281d9377922fb9dbfe515948d5df7fb13236739b84adc0451de564308045
SimHash 18105105f920

Groups

*

Rule Path
Allow /

grapeshot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

siteauditbot

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

semrushbot-swa

Rule Path
Disallow /

semrushbot-ct

Rule Path
Disallow /

splitsignalbot

Rule Path
Disallow /

semrushbot-coub

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /