instaresearch.co.uk
robots.txt

Robots Exclusion Standard data for instaresearch.co.uk

Resource Scan

Scan Details

Site Domain instaresearch.co.uk
Base Domain instaresearch.co.uk
Scan Status Ok
Last Scan2026-02-14T17:29:39+00:00
Next Scan 2026-03-16T17:29:39+00:00

Last Scan

Scanned2026-02-14T17:29:39+00:00
URL https://instaresearch.co.uk/robots.txt
Domain IPs 104.21.56.110, 172.67.184.166, 2606:4700:3036::ac43:b8a6, 2606:4700:3037::6815:386e
Response IP 172.67.184.166
Found Yes
Hash c0380b4c65cfac88b231b5ccd4f258dc009df066964b536321a4311d92bae69d
SimHash 6d34ed9647d1

Groups

*

Rule Path
Allow /
Disallow /admin/
Disallow /our-samples/
Disallow /wp/

ia_archiver-web.archive.org

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

turnitinbot/2.1

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

emailwolf

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

aboutusbot

Rule Path
Disallow /

robtexbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://instaresearch.co.uk/sitemap/index.xml

Comments

  • Allow all crawlers access to certain pages.
  • Disallow Following crawlers access to site.

Warnings

  • `host` is not a known field.