keighleyrufc.com
robots.txt

Robots Exclusion Standard data for keighleyrufc.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	keighleyrufc.com
Base Domain	keighleyrufc.com
Scan Status	Ok
Last Scan	2024-11-13T22:03:05+00:00
Next Scan	2024-11-20T22:03:05+00:00

Last Scan

Scanned	2024-11-13T22:03:05+00:00
URL	https://www.keighleyrufc.com/robots.txt
Domain IPs	108.156.133.19, 108.156.133.25, 108.156.133.61, 108.156.133.69
Response IP	108.156.133.19
Found	Yes
Hash	1105d3093daae91e02e57b90a2a17902006d30a1d01eb592d3630ee58132c5a3
SimHash	090a9570d3a1

Groups

*

Rule	Path
Disallow	/webmaster/
Disallow	/proclubadmin/
Disallow	/divisionadmin/
Disallow	/division-admin/
Disallow	/competitionadmin/
Disallow	/oscar/
Disallow	/v5clubs/
Disallow	/_subdomains/
Disallow	/_services/
Disallow	/ct/
Disallow	/sports/activity-feed

Rule

Path

Disallow

/webmaster/

Disallow

/proclubadmin/

Disallow

/divisionadmin/

Disallow

/division-admin/

Disallow

/competitionadmin/

Disallow

/oscar/

Disallow

/v5clubs/

Disallow

/_subdomains/

Disallow

/_services/

Disallow

/ct/

Disallow

/sports/activity-feed

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

mediapartners-google

Rule	Path
Allow	/

Rule

Path

Allow

magpie-crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

seokicks-robot

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

mauibot

Rule	Path
Disallow	/

Rule

Path

Disallow

bl.uk_lddc_bot

Rule	Path
Disallow	/

Rule

Path

Disallow

bl.uk_ldfc_bot

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

seekportbot

Rule	Path
Disallow	/

Rule

Path

Disallow

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

barkrowler

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

bytedance

Rule	Path
Disallow	/

Rule

Path

Disallow

Comments

These bots did not respect the Crawl-delay directive and so have been disallowed

keighleyrufc.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

mediapartners-google

magpie-crawler

seokicks-robot

mj12bot

mauibot

bl.uk_lddc_bot

bl.uk_ldfc_bot

ahrefsbot

dotbot

semrushbot

seekportbot

petalbot

barkrowler

bytespider

bytedance

Comments

keighleyrufc.com
robots.txt