brandrepublic.com
robots.txt

Robots Exclusion Standard data for brandrepublic.com

Resource Scan

Scan Details

Site Domain brandrepublic.com
Base Domain brandrepublic.com
Scan Status Ok
Last Scan2024-06-10T11:59:57+00:00
Next Scan 2024-06-17T11:59:57+00:00

Last Scan

Scanned2024-06-10T11:59:57+00:00
URL https://brandrepublic.com/robots.txt
Redirect https://www.campaignlive.co.uk/robots.txt
Redirect Domain www.campaignlive.co.uk
Redirect Base campaignlive.co.uk
Domain IPs 104.21.54.171, 172.67.140.176, 2606:4700:3033::ac43:8cb0, 2606:4700:3034::6815:36ab
Redirect IPs 104.21.1.237, 172.67.152.149, 2606:4700:3035::6815:1ed, 2606:4700:3037::ac43:9895
Response IP 104.21.1.237
Found Yes
Hash 0415f8bc9f9455f2ee8520e923d026ab297796ad0e35df815b523ed8f205fe79
SimHash 8a0618028581

Groups

*

Rule Path
Disallow /search/
Disallow /login?
Disallow /rulesforcommenting/
Disallow /PAGE/*
Disallow /page/*
Disallow /register/?

gptbot

Rule Path
Disallow /
Disallow /jobs/session-img/
Disallow /jobs/invalid-request/
Disallow /jobs/document/
Disallow /jobs/apply-profile/
Disallow /jobs/emailjob/
Disallow /jobs/logon/
Disallow /jobs/register/
Disallow /jobs/searchjobs/
Disallow /jobs/*/*/*/*/*/*/
Disallow /ads-test-page-1
Disallow /ads-test-page-2
Disallow /ads-test-page-3

googlebot-news

Rule Path
Disallow /jobs/

Other Records

Field Value
sitemap https://www.campaignlive.co.uk/newsmap.xml
sitemap https://www.campaignlive.co.uk/sitemap.xml
sitemap https://www.campaignlive.co.uk/jobs/sitemapindex.xml

Comments

  • Editorial Site
  • Job Site
  • PageCreator pages