essexcountystandard.co.uk
robots.txt

Robots Exclusion Standard data for essexcountystandard.co.uk

Archived Snapshots

Resource Scan

Scan Details

Site Domain	essexcountystandard.co.uk
Base Domain	essexcountystandard.co.uk
Scan Status	Ok
Last Scan	2024-11-01T23:23:33+00:00
Next Scan	2024-11-08T23:23:33+00:00

Last Scan

Scanned	2024-11-01T23:23:33+00:00
URL	https://essexcountystandard.co.uk/robots.txt
Domain IPs	93.174.10.103
Response IP	93.174.10.103
Found	Yes
Hash	ff2c75d08b26f6490aab44e925b3900cf57d5bba3b869f04d1422f0e7e9aba29
SimHash	5b649212ed12

Groups

*

Rule	Path
Disallow	/__siren/
Disallow	/resources/images/captcha2*
Disallow	/search/*
Disallow	/soldpricesearch/*
Disallow	/announcements/public_notices/download/*

Rule

Path

Disallow

/__siren/

Disallow

/resources/images/captcha2*

Disallow

/search/*

Disallow

/soldpricesearch/*

Disallow

/announcements/public_notices/download/*

googlebot-news

Rule	Path
Disallow	/advertise/
Disallow	/advertising/
Disallow	/li/
Disallow	/mart/
Disallow	/trade_directory/
Disallow	/dating/
Disallow	/homes/
Disallow	/jobs/

Rule

Path

Disallow

/advertise/

Disallow

/advertising/

Disallow

/li/

Disallow

/mart/

Disallow

/trade_directory/

Disallow

/dating/

Disallow

/homes/

Disallow

/jobs/

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

perplexitybot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://www.gazette-news.co.uk/sitemap.xml

Field

Value

sitemap

https://www.gazette-news.co.uk/sitemap.xml

Back to top

essexcountystandard.co.ukrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

googlebot-news

gptbot

perplexitybot

Other Records

essexcountystandard.co.uk
robots.txt