/.well-known/

Log In Sign Up

jihouse.com.tw
robots.txt

Robots Exclusion Standard data for jihouse.com.tw

Archived Snapshots

Resource Scan

Scan Details

Site Domain	jihouse.com.tw
Base Domain	jihouse.com.tw
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2024-04-13T07:36:10+00:00
Next Scan	2024-07-12T07:36:10+00:00

Last Successful Scan

Scanned	2023-05-27T07:34:12+00:00
URL	https://jihouse.com.tw/robots.txt
Redirect	https://www.jihouse.com.tw/robots.txt
Redirect Domain	www.jihouse.com.tw
Redirect Base	jihouse.com.tw
Domain IPs	15.197.142.173, 3.33.152.147, 54.183.102.22
Redirect IPs	18.181.31.166, 54.248.227.74
Response IP	18.176.133.53
Found	Yes
Hash	c48a5330bacb24396a981dc27052eac8164481503dcded9c8b7c4b0dc195bf21
SimHash	ba8d6fa56450

Groups

semrushbot

Rule

Path

Disallow

/

blackwidow

Rule

Path

Disallow

/

Back to top

Other Records

Field

Value

sitemap

https://www.jihouse.com.tw/sitemap.xml

Back to top

Comments

See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
To ban all spiders from the entire site uncomment the next two lines:
User-Agent: *
Disallow: /

Back to top