hiroshi-shih.com
robots.txt

Robots Exclusion Standard data for hiroshi-shih.com

Resource Scan

Scan Details

Site Domain hiroshi-shih.com
Base Domain hiroshi-shih.com
Scan Status Ok
Last Scan2025-08-25T11:21:31+00:00
Next Scan 2025-09-24T11:21:31+00:00

Last Scan

Scanned2025-08-25T11:21:31+00:00
URL https://hiroshi-shih.com/robots.txt
Redirect https://www.hiroshi-shih.com/robots.txt
Redirect Domain www.hiroshi-shih.com
Redirect Base hiroshi-shih.com
Domain IPs 199.34.228.73
Redirect IPs 199.34.228.73
Response IP 199.34.228.73
Found Yes
Hash be7ea7e43fee840e4131a69c4b66faa0a4305e8ea0763451e10a624d26fcc6ec
SimHash 4a54dc726793

Groups

nerdybot

Rule Path
Disallow /

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ajax/
Disallow /apps/
Disallow /http%3A//store.hiroshi-shih.com/
Disallow /http%3A//hiroshi-shih.blogspot.com

Other Records

Field Value
sitemap https://www.hiroshi-shih.com/sitemap.xml