thepencompany.com
robots.txt

Robots Exclusion Standard data for thepencompany.com

Resource Scan

Scan Details

Site Domain thepencompany.com
Base Domain thepencompany.com
Scan Status Ok
Last Scan2024-05-19T20:12:06+00:00
Next Scan 2024-06-18T20:12:06+00:00

Last Scan

Scanned2024-05-19T20:12:06+00:00
URL https://thepencompany.com/robots.txt
Redirect https://www.thepencompany.com/robots.txt
Redirect Domain www.thepencompany.com
Redirect Base thepencompany.com
Domain IPs 104.26.2.159, 104.26.3.159, 172.67.71.209, 2606:4700:20::681a:29f, 2606:4700:20::681a:39f, 2606:4700:20::ac43:47d1
Redirect IPs 104.26.2.159, 104.26.3.159, 172.67.71.209, 2606:4700:20::681a:29f, 2606:4700:20::681a:39f, 2606:4700:20::ac43:47d1
Response IP 104.26.3.159
Found Yes
Hash 84f74a9a204f680c7ca6af5e28e5c9d86effdef71059e844f899b58dbc903f30
SimHash 2479584c09d3

Groups

*

Rule Path
Disallow */search/*
Disallow */.well-known/*
Disallow */maintenance/*
Disallow */basket/*
Disallow */checkout/*

Other Records

Field Value
sitemap https://www.thepencompany.com/sitemap.xml