thehiu.com
robots.txt

Robots Exclusion Standard data for thehiu.com

Resource Scan

Scan Details

Site Domain thehiu.com
Base Domain thehiu.com
Scan Status Ok
Last Scan2026-04-08T12:07:08+00:00
Next Scan 2026-04-15T12:07:08+00:00

Last Scan

Scanned2026-04-08T12:07:08+00:00
URL https://thehiu.com/robots.txt
Redirect https://www.thehiu.com/robots.txt
Redirect Domain www.thehiu.com
Redirect Base thehiu.com
Domain IPs 45.252.250.30
Redirect IPs 45.252.250.30
Response IP 45.252.250.30
Found Yes
Hash ba664cb424a4e9ac119d1f707dc82276ce9a9805e6fe01b1a5f2fe416119b135
SimHash 4d208c406fb0

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

googlebot

Rule Path
Disallow /*?filter=*
Disallow /*?sort=*
Disallow /*?page=*
Disallow /search?*
Disallow /facets/*

Other Records

Field Value
sitemap https://www.thehiu.com/sitemap_index.xml