roman.co.uk
robots.txt

Robots Exclusion Standard data for roman.co.uk

Resource Scan

Scan Details

Site Domain roman.co.uk
Base Domain roman.co.uk
Scan Status Ok
Last Scan2024-10-08T05:56:59+00:00
Next Scan 2024-11-07T05:56:59+00:00

Last Scan

Scanned2024-10-08T05:56:59+00:00
URL https://roman.co.uk/robots.txt
Redirect https://www.roman.co.uk/robots.txt
Redirect Domain www.roman.co.uk
Redirect Base roman.co.uk
Domain IPs 104.18.36.169, 172.64.151.87, 2606:4700:4400::6812:24a9, 2606:4700:4400::ac40:9757
Redirect IPs 104.18.36.169, 172.64.151.87, 2606:4700:4400::6812:24a9, 2606:4700:4400::ac40:9757
Response IP 172.64.151.87
Found Yes
Hash 2e2e27c20c9fa78b941e445376953b86e7b1765400dd80c3c27bf93fe47e0e7a
SimHash e10d73558713

Groups

*

Rule Path
Disallow /*?q=*
Disallow /search*
Disallow *?request_type*
Disallow /account*
Disallow /basket*
Disallow /checkout*
Disallow /admin*
Disallow /size-guide?nolayout*
Disallow /quickview/
Disallow /size-guide/roman
Disallow /size-guide/dusk
Disallow /size-guide/petite
Disallow /size-guide/curve
Disallow /pdp-delivery-and-returns

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

pinterestbot

Rule Path
Disallow /*?*
Disallow /account*
Disallow /basket*
Disallow /checkout*
Disallow /admin*

Other Records

Field Value
sitemap https://www.roman.co.uk/sitemap.xml

Comments

  • AllBots
  • Search
  • BloomreachURLs
  • BlockSecureAreas
  • NoIndex
  • DiscoverSitemap
  • User Agents