printerbase.co.uk
robots.txt

Robots Exclusion Standard data for printerbase.co.uk

Resource Scan

Scan Details

Site Domain printerbase.co.uk
Base Domain printerbase.co.uk
Scan Status Ok
Last Scan2025-06-24T08:17:10+00:00
Next Scan 2025-07-24T08:17:10+00:00

Last Scan

Scanned2025-06-24T08:17:10+00:00
URL https://printerbase.co.uk/robots.txt
Redirect https://www.printerbase.co.uk/robots.txt
Redirect Domain www.printerbase.co.uk
Redirect Base printerbase.co.uk
Domain IPs 104.26.12.118, 104.26.13.118, 172.67.75.9, 2606:4700:20::681a:c76, 2606:4700:20::681a:d76, 2606:4700:20::ac43:4b09
Redirect IPs 104.26.12.118, 104.26.13.118, 172.67.75.9, 2606:4700:20::681a:c76, 2606:4700:20::681a:d76, 2606:4700:20::ac43:4b09
Response IP 104.26.12.118
Found Yes
Hash 91322467db714f74c6e7b5fd06f71bdb78f09cc38f68d32792dfbadeffc89df9
SimHash a5269f4ae33b

Groups

imagesiftbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

*

Rule Path
Disallow /app/
Disallow /bin/
Disallow /dev/
Disallow /lib/
Disallow /phpserver/
Disallow /pkginfo/
Disallow /report/
Disallow /setup/
Disallow /update/
Disallow /var/
Disallow /vendor/
Disallow /compare/
Disallow /search/
Disallow /mwdownloads/download/link/
Disallow /*my-account
Disallow /*catalog/category/view/
Disallow /*catalog/product/view/
Disallow /*SID%3D%E2%80%9D
Disallow /*?customFilters
Disallow /*?listingCount
Disallow /*?sortKey
Disallow /news/wp-admin/
Disallow /news/?s=

Other Records

Field Value
sitemap https://www.printerbase.co.uk/media/sitemap.xml
sitemap https://www.printerbase.co.uk/news/sitemap_index.xml

Comments

  • Crawlers Setup
  • Added by PK 23/07/2024
  • Added by PK 08/05/2025
  • Directories
  • Added by PK 07/09/2022
  • Added by PK 27/09/2022
  • Paths
  • Disallow: */checkout Removed due to conflicting CSS files with the same path
  • Parameters to prevent server overload
  • WordPress added by PK 10/03/2025
  • Magento Sitemaps