hanahaki.com
robots.txt

Robots Exclusion Standard data for hanahaki.com

Resource Scan

Scan Details

Site Domain hanahaki.com
Base Domain hanahaki.com
Scan Status Ok
Last Scan2024-10-26T17:16:43+00:00
Next Scan 2024-11-02T17:16:43+00:00

Last Scan

Scanned2024-10-26T17:16:43+00:00
URL https://hanahaki.com/robots.txt
Redirect https://www.hanahaki.com/robots.txt
Redirect Domain www.hanahaki.com
Redirect Base hanahaki.com
Domain IPs 104.21.60.199, 172.67.201.21, 2606:4700:3030::6815:3cc7, 2606:4700:3036::ac43:c915
Redirect IPs 104.21.60.199, 172.67.201.21, 2606:4700:3030::6815:3cc7, 2606:4700:3036::ac43:c915
Response IP 104.21.60.199
Found Yes
Hash f4919629b28427a3f31b51997c858aecfe319efe4b1e3d2232d9f3cf51c2f662
SimHash 43100a0286b8

Groups

*

Rule Path
Allow /
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /e/
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /wp-json/
Disallow /show-error-*
Disallow /xmlrpc.php
Disallow /trackback/
Disallow /readme.html
Allow /*.css
Allow /*.js
Allow /wp-content/uploads/

Other Records

Field Value
sitemap https://www.hanahaki.com/sitemap.xml

Warnings

  • `host` is not a known field.