guernseygazette.com
robots.txt

Robots Exclusion Standard data for guernseygazette.com

Resource Scan

Scan Details

Site Domain guernseygazette.com
Base Domain guernseygazette.com
Scan Status Ok
Last Scan2024-11-12T21:27:30+00:00
Next Scan 2024-11-19T21:27:30+00:00

Last Scan

Scanned2024-11-12T21:27:30+00:00
URL https://guernseygazette.com/robots.txt
Redirect https://www.guernseygazette.com/robots.txt
Redirect Domain www.guernseygazette.com
Redirect Base guernseygazette.com
Domain IPs 65.61.154.7
Redirect IPs 65.61.154.7
Response IP 65.61.154.7
Found Yes
Hash 7161d2dcaa7ac87366d78a3bb9f4670904ef53841220d1912b6f2e14859b651c
SimHash 84f31986e9d3

Groups

*

Rule Path
Disallow /css/
Disallow /css_system/
Disallow /js/
Disallow /js_system/
Disallow /account/
Disallow /calendar/post/
Disallow /forms/
Disallow /login.html
Disallow /poll_process.html
Disallow /post_comments.html
Disallow /my_profile.html
Disallow /my_stuff.html
Disallow /user_profile.html
Disallow /ajax/
Disallow /register.html
Disallow /report_item.html
Disallow /send_item.html
Disallow /subscribe/
Disallow /renew/
Disallow /account/
Disallow /resetpassword/
Disallow /reset/
Disallow /alacarte/
Disallow /lookup/
Disallow /register-local/
Disallow /entercode/

Other Records

Field Value
sitemap https://www.guernseygazette.com/sitemaps/sitemaps-r2-default-guernsey-1.xml
sitemap https://www.guernseygazette.com/sitemaps/sitemaps-r2-googlenews-guernsey-1.xml