instaresearch.co.uk
robots.txt

Robots Exclusion Standard data for instaresearch.co.uk

Archived Snapshots

Resource Scan

Scan Details

Site Domain	instaresearch.co.uk
Base Domain	instaresearch.co.uk
Scan Status	Ok
Last Scan	2026-02-14T17:29:39+00:00
Next Scan	2026-03-16T17:29:39+00:00

Last Scan

Scanned	2026-02-14T17:29:39+00:00
URL	https://instaresearch.co.uk/robots.txt
Domain IPs	104.21.56.110, 172.67.184.166, 2606:4700:3036::ac43:b8a6, 2606:4700:3037::6815:386e
Response IP	172.67.184.166
Found	Yes
Hash	c0380b4c65cfac88b231b5ccd4f258dc009df066964b536321a4311d92bae69d
SimHash	6d34ed9647d1

Groups

*

Rule	Path
Allow	/
Disallow	/admin/
Disallow	/our-samples/
Disallow	/wp/

Rule

Path

Allow

/

Disallow

/admin/

Disallow

/our-samples/

Disallow

/wp/

ia_archiver-web.archive.org

Rule	Path
Disallow	/

Rule

Path

Disallow

/

ia_archiver

Rule	Path
Disallow	/

Rule

Path

Disallow

/

turnitinbot/2.1

Rule	Path
Disallow	/

Rule

Path

Disallow

/

turnitinbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

surveybot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

emailwolf

Rule	Path
Disallow	/

Rule

Path

Disallow

/

emailsiphon

Rule	Path
Disallow	/

Rule

Path

Disallow

/

emailcollector

Rule	Path
Disallow	/

Rule

Path

Disallow

/

aboutusbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

robtexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://instaresearch.co.uk/sitemap/index.xml

Field

Value

sitemap

https://instaresearch.co.uk/sitemap/index.xml

Back to top

Comments

Allow all crawlers access to certain pages.
Disallow Following crawlers access to site.

Back to top

Warnings

`host` is not a known field.

Back to top

instaresearch.co.ukrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

ia_archiver-web.archive.org

ia_archiver

turnitinbot/2.1

turnitinbot

surveybot

emailwolf

emailsiphon

emailcollector

aboutusbot

robtexbot

Other Records

Comments

Warnings

instaresearch.co.uk
robots.txt