news18.tw
robots.txt

Robots Exclusion Standard data for news18.tw

Resource Scan

Scan Details

Site Domain news18.tw
Base Domain news18.tw
Scan Status Ok
Last Scan2024-05-07T21:01:17+00:00
Next Scan 2024-05-14T21:01:17+00:00

Last Scan

Scanned2024-05-07T21:01:17+00:00
URL https://news18.tw/robots.txt
Domain IPs 104.21.38.76, 172.67.220.21, 2606:4700:3033::6815:264c, 2606:4700:3035::ac43:dc15
Response IP 104.21.38.76
Found Yes
Hash d9333f4c59da446edd57de246071567ad3de3afbd14ed10e4b7dee55bbd1a363
SimHash a0011c01c9f3

Groups

*

Rule Path
Disallow /admin*
Disallow /template*

Other Records

Field Value
sitemap /sitemap/amp/index.xml
sitemap /sitemap/mobile/index.xml
sitemap /sitemap/pc/index.xml