archiveuk.biz
robots.txt

Robots Exclusion Standard data for archiveuk.biz

Archived Snapshots

Resource Scan

Scan Details

Site Domain	archiveuk.biz
Base Domain	archiveuk.biz
Scan Status	Ok
Last Scan	5/28/2025, 4:28:20 PM
Next Scan	6/4/2025, 4:28:20 PM

Last Scan

Scanned	5/28/2025, 4:28:20 PM
URL	https://archiveuk.biz/robots.txt
Domain IPs	104.21.49.32, 172.67.140.132, 2606:4700:3036::ac43:8c84, 2606:4700:3037::6815:3120
Response IP	172.67.140.132
Found	Yes
Hash	d7f4614ae3971e3f9ec35877ff44ea5798a0b372c6725d13e04301e42b6a7ed4
SimHash	501d0142f560

Groups

daumoa

Rule	Path
Disallow	/

Rule

Path

Disallow

cliqzbot

Rule	Path
Disallow	/

Rule

Path

Disallow

crawler4j

Rule	Path
Disallow	/

Rule

Path

Disallow

getintent

Rule	Path
Disallow	/

Rule

Path

Disallow

coccoc

Rule	Path
Disallow	/

Rule

Path

Disallow

proximic

Rule	Path
Disallow	/

Rule

Path

Disallow

grapeshot

Rule	Path
Disallow	/

Rule

Path

Disallow

ltx71

Rule	Path
Disallow	/

Rule

Path

Disallow

jamesbot

Rule	Path
Disallow	/

Rule

Path

Disallow

smtbot

Rule	Path
Disallow	/

Rule

Path

Disallow

scrapy

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

imagesiftbot

Rule	Path
Disallow	/

Rule

Path

Disallow

meta-externalagent

Rule	Path
Disallow	/

Rule

Path

Disallow

facebookexternalhit

Rule	Path
Disallow	/

Rule

Path

Disallow

serpstatbot

Rule	Path
Disallow	/

Rule

Path

Disallow

dataforseobot

Rule	Path
Disallow	/

Rule

Path

Disallow

barkrowler

Rule	Path
Disallow	/

Rule

Path

Disallow

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Disallow	/search?
Disallow	/edit/
Disallow	/cdn-cgi/
Disallow	/dynjs/
Disallow	/dyn/actions/
Disallow	/en/search?
Allow	/

Rule

Path

Disallow

/search?

Disallow

/edit/

Disallow

/cdn-cgi/

Disallow

/dynjs/

Disallow

/dyn/actions/

Disallow

/en/search?

Allow

Other Records

Field	Value
sitemap	https://archiveuk.biz/sitemaps/sitemap_index.xml

Field

Value

sitemap

https://archiveuk.biz/sitemaps/sitemap_index.xml

archiveuk.bizrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

daumoa

cliqzbot

crawler4j

getintent

coccoc

proximic

grapeshot

ltx71

jamesbot

smtbot

scrapy

gptbot

claudebot

amazonbot

blexbot

ccbot

bytespider

ahrefsbot

mj12bot

semrushbot

imagesiftbot

meta-externalagent

facebookexternalhit

serpstatbot

dataforseobot

barkrowler

petalbot

*

Other Records

archiveuk.biz
robots.txt