hensmanhost.com
robots.txt

Robots Exclusion Standard data for hensmanhost.com

Resource Scan

Scan Details

Site Domain hensmanhost.com
Base Domain hensmanhost.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-05-20T23:46:00+00:00
Next Scan 2025-08-18T23:46:00+00:00

Last Successful Scan

Scanned2023-07-24T23:05:23+00:00
URL http://hensmanhost.com/robots.txt
Redirect http://www.hensmandesign.ie//robots.txt
Redirect Domain www.hensmandesign.ie
Redirect Base hensmandesign.ie
Domain IPs 96.45.82.199, 96.45.82.28, 96.45.83.130, 96.45.83.88
Redirect IPs 34.117.168.233
Response IP 34.117.168.233
Found Yes
Hash b70dac1f06a6abaa4b03f244fc7469528b7ec183240b92a62af94b90a37780d1
SimHash 22d28a42d616

Groups

*

Rule Path
Allow /

adsbot-google-mobile
adsbot-google

Rule Path
Disallow /_api/*
Disallow /_partials*
Disallow /pro-gallery-webapp/v1/galleries/*

petalbot

Rule Path
Disallow /

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.hensmandesign.ie/sitemap.xml

Comments

  • Optimization for Google Ads Bot
  • Block PetalBot
  • Crawl delay for overly enthusiastic bots
  • Auto generated, go to SEO Tools > Robots.txt Editor to change this