edh.tw
robots.txt

Robots Exclusion Standard data for edh.tw

Resource Scan

Scan Details

Site Domain edh.tw
Base Domain edh.tw
Scan Status Ok
Last Scan2024-05-12T03:21:26+00:00
Next Scan 2024-05-19T03:21:26+00:00

Last Scan

Scanned2024-05-12T03:21:26+00:00
URL https://edh.tw/robots.txt
Redirect https://www.edh.tw/robots.txt
Redirect Domain www.edh.tw
Redirect Base edh.tw
Domain IPs 13.225.4.10, 13.225.4.103, 13.225.4.13, 13.225.4.17, 2600:9000:21b4:200:b:b3f0:1100:93a1, 2600:9000:21b4:3400:b:b3f0:1100:93a1, 2600:9000:21b4:8e00:b:b3f0:1100:93a1, 2600:9000:21b4:a800:b:b3f0:1100:93a1, 2600:9000:21b4:ac00:b:b3f0:1100:93a1, 2600:9000:21b4:b600:b:b3f0:1100:93a1, 2600:9000:21b4:c00:b:b3f0:1100:93a1, 2600:9000:21b4:e600:b:b3f0:1100:93a1
Redirect IPs 13.33.88.75, 13.33.88.78, 13.33.88.79, 13.33.88.92, 2600:9000:223b:1c00:3:f659:d580:93a1, 2600:9000:223b:4000:3:f659:d580:93a1, 2600:9000:223b:8200:3:f659:d580:93a1, 2600:9000:223b:8e00:3:f659:d580:93a1, 2600:9000:223b:a000:3:f659:d580:93a1, 2600:9000:223b:be00:3:f659:d580:93a1, 2600:9000:223b:e200:3:f659:d580:93a1, 2600:9000:223b:ee00:3:f659:d580:93a1
Response IP 13.33.88.92
Found Yes
Hash 109981745b0ddf13d71cd604e274bfc7168bad8876d86104b887889aac12b46c
SimHash cb05cc468193

Groups

yisouspider

Rule Path
Disallow /

megaindex

Rule Path
Disallow /

*

Rule Path
Allow /
Disallow /admin/
Disallow /plugins/feedback.php
Disallow /epaper.do
Disallow /iframearticle.do
Disallow /m/ads/
Disallow /resources/ad/
Disallow /previews/

Other Records

Field Value
sitemap https://www.edh.tw/upload/all_rss.xml
sitemap https://www.edh.tw/upload/tag_sitemap.xml
sitemap https://www.edh.tw/upload/rss.xml
sitemap https://www.edh.tw/upload/sitemap.xml
sitemap https://www.edh.tw/upload/googlenews.xml.gz