newarkrugby.com
robots.txt

Robots Exclusion Standard data for newarkrugby.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	newarkrugby.com
Base Domain	newarkrugby.com
Scan Status	Ok
Last Scan	2024-11-11T10:36:28+00:00
Next Scan	2024-11-18T10:36:28+00:00

Last Scan

Scanned	2024-11-11T10:36:28+00:00
URL	https://newarkrugby.com/robots.txt
Redirect	https://www.newarkrugby.com/robots.txt
Redirect Domain	www.newarkrugby.com
Redirect Base	newarkrugby.com
Domain IPs	52.223.32.97
Redirect IPs	13.225.4.102, 13.225.4.108, 13.225.4.25, 13.225.4.73
Response IP	13.225.4.108
Found	Yes
Hash	1105d3093daae91e02e57b90a2a17902006d30a1d01eb592d3630ee58132c5a3
SimHash	090a9570d3a1

Groups

*

Rule	Path
Disallow	/webmaster/
Disallow	/proclubadmin/
Disallow	/divisionadmin/
Disallow	/division-admin/
Disallow	/competitionadmin/
Disallow	/oscar/
Disallow	/v5clubs/
Disallow	/_subdomains/
Disallow	/_services/
Disallow	/ct/
Disallow	/sports/activity-feed

Rule

Path

Disallow

/webmaster/

Disallow

/proclubadmin/

Disallow

/divisionadmin/

Disallow

/division-admin/

Disallow

/competitionadmin/

Disallow

/oscar/

Disallow

/v5clubs/

Disallow

/_subdomains/

Disallow

/_services/

Disallow

/ct/

Disallow

/sports/activity-feed

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

mediapartners-google

Rule	Path
Allow	/

Rule

Path

Allow

magpie-crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

seokicks-robot

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

mauibot

Rule	Path
Disallow	/

Rule

Path

Disallow

bl.uk_lddc_bot

Rule	Path
Disallow	/

Rule

Path

Disallow

bl.uk_ldfc_bot

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

seekportbot

Rule	Path
Disallow	/

Rule

Path

Disallow

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

barkrowler

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

bytedance

Rule	Path
Disallow	/

Rule

Path

Disallow

Comments

These bots did not respect the Crawl-delay directive and so have been disallowed

newarkrugby.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

mediapartners-google

magpie-crawler

seokicks-robot

mj12bot

mauibot

bl.uk_lddc_bot

bl.uk_ldfc_bot

ahrefsbot

dotbot

semrushbot

seekportbot

petalbot

barkrowler

bytespider

bytedance

Comments

newarkrugby.com
robots.txt