penguin.co.uk
robots.txt

Robots Exclusion Standard data for penguin.co.uk

Resource Scan

Scan Details

Site Domain penguin.co.uk
Base Domain penguin.co.uk
Scan Status Ok
Last Scan2024-09-26T02:40:07+00:00
Next Scan 2024-10-26T02:40:07+00:00

Last Scan

Scanned2024-09-26T02:40:07+00:00
URL https://penguin.co.uk/robots.txt
Redirect https://www.penguin.co.uk/robots.txt
Redirect Domain www.penguin.co.uk
Redirect Base penguin.co.uk
Domain IPs 141.193.213.30, 141.193.213.31
Redirect IPs 141.193.213.30, 141.193.213.31
Response IP 141.193.213.30
Found Yes
Hash 21dbea84961357c463189438c22e96f0f71c17414a06e602e699a969fd825a60
SimHash 7908e92c7eb3

Groups

*

Rule Path
Allow *
Disallow /api/*
Disallow /preview/*
Disallow /*.php$
Disallow /*?p=*&
Disallow /*search-results?q=*
Disallow /*%26q%3D*
Disallow /*search-results?imprint*
Disallow /*search-results.html?q=*
Disallow *wp-admin/*
Disallow *?resize=*

Other Records

Field Value
sitemap https://www.penguin.co.uk/wp-sitemap.xml

Comments

  • Directories
  • Paths (no clean URLs)