campaignlive.co.uk
robots.txt

Robots Exclusion Standard data for campaignlive.co.uk

Resource Scan

Scan Details

Site Domain campaignlive.co.uk
Base Domain campaignlive.co.uk
Scan Status Ok
Last Scan2024-11-08T21:51:35+00:00
Next Scan 2024-12-08T21:51:35+00:00

Last Scan

Scanned2024-11-08T21:51:35+00:00
URL https://campaignlive.co.uk/robots.txt
Redirect https://www.campaignlive.co.uk/robots.txt
Redirect Domain www.campaignlive.co.uk
Redirect Base campaignlive.co.uk
Domain IPs 104.26.14.188, 104.26.15.188, 172.67.74.27, 2606:4700:20::681a:ebc, 2606:4700:20::681a:fbc, 2606:4700:20::ac43:4a1b
Redirect IPs 104.26.14.188, 104.26.15.188, 172.67.74.27, 2606:4700:20::681a:ebc, 2606:4700:20::681a:fbc, 2606:4700:20::ac43:4a1b
Response IP 104.26.15.188
Found Yes
Hash 0415f8bc9f9455f2ee8520e923d026ab297796ad0e35df815b523ed8f205fe79
SimHash 8a0618028581

Groups

*

Rule Path
Disallow /search/
Disallow /login?
Disallow /rulesforcommenting/
Disallow /PAGE/*
Disallow /page/*
Disallow /register/?

gptbot

Rule Path
Disallow /
Disallow /jobs/session-img/
Disallow /jobs/invalid-request/
Disallow /jobs/document/
Disallow /jobs/apply-profile/
Disallow /jobs/emailjob/
Disallow /jobs/logon/
Disallow /jobs/register/
Disallow /jobs/searchjobs/
Disallow /jobs/*/*/*/*/*/*/
Disallow /ads-test-page-1
Disallow /ads-test-page-2
Disallow /ads-test-page-3

googlebot-news

Rule Path
Disallow /jobs/

Other Records

Field Value
sitemap https://www.campaignlive.co.uk/newsmap.xml
sitemap https://www.campaignlive.co.uk/sitemap.xml
sitemap https://www.campaignlive.co.uk/jobs/sitemapindex.xml

Comments

  • Editorial Site
  • Job Site
  • PageCreator pages