thegenepool.co.uk
robots.txt

Robots Exclusion Standard data for thegenepool.co.uk

Resource Scan

Scan Details

Site Domain thegenepool.co.uk
Base Domain thegenepool.co.uk
Scan Status Ok
Last Scan2024-06-26T02:30:24+00:00
Next Scan 2024-07-03T02:30:24+00:00

Last Scan

Scanned2024-06-26T02:30:24+00:00
URL https://www.thegenepool.co.uk/robots.txt
Domain IPs 104.21.39.143, 172.67.146.29, 2606:4700:3030::6815:278f, 2606:4700:3030::ac43:921d
Response IP 104.21.39.143
Found Yes
Hash a544d626452a9f7be088e760fca7a131888b14ac12ff895341b439315a973f31
SimHash c8568e3169d0

Groups

*

Rule Path
Disallow /admin/*

domaincrawler

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

coccocbot-image

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

garlikcrawler

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

megaindex

Rule Path
Disallow /

centurybot9

Rule Path
Disallow /

webdatastats

Rule Path
Disallow /

re-re studio

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

scrapy/1.8.0

Rule Path
Disallow /

bidswitchbot

Rule Path
Disallow /

lcc

Rule Path
Disallow /

goodbot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.thegenepool.co.uk/googlesitemap.asp

Comments

  • If the site is on a preview address (eg. test12.uc4.co.uk) and you don't want it to be crawled by Google, Bing etc,
  • uncomment the user-agent and disallow lines otherwise make sure you REMOVE them for a live site that is going to be submitted.
  • User-agent: DomainCrawler
  • Disallow: /
  • block robots