h-in-c.com
robots.txt

Robots Exclusion Standard data for h-in-c.com

Resource Scan

Scan Details

Site Domain h-in-c.com
Base Domain h-in-c.com
Scan Status Ok
Last Scan2025-08-06T18:15:32+00:00
Next Scan 2025-09-05T18:15:32+00:00

Last Scan

Scanned2025-08-06T18:15:32+00:00
URL https://h-in-c.com/robots.txt
Domain IPs 183.111.139.224, 183.111.139.236, 203.245.12.122, 203.245.12.99
Response IP 203.245.12.122
Found Yes
Hash 95077f26167222fc220a06cc1164f25648d38f6731a3703fa6c18b20f220bce5
SimHash 6d113232e603

Groups

*

Rule Path
Disallow /admin/
Disallow /api/
Disallow /order/
Disallow /basket/
Disallow /checkout/
Disallow /myshop/
Disallow /login/
Disallow /member/
Disallow /search.html?*
Disallow /*?*sort=
Disallow /*?*filter=
Allow /

googlebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

yeti

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://h-in-c.com/sitemap.xml