asahi.co.jp
robots.txt

Robots Exclusion Standard data for asahi.co.jp

Resource Scan

Scan Details

Site Domain asahi.co.jp
Base Domain asahi.co.jp
Scan Status Ok
Last Scan2024-10-18T12:16:18+00:00
Next Scan 2024-10-25T12:16:18+00:00

Last Scan

Scanned2024-10-18T12:16:18+00:00
URL https://asahi.co.jp/robots.txt
Domain IPs 2600:9000:2047:9000:0:dccb:c640:93a1, 2600:9000:2047:e600:0:dccb:c640:93a1, 54.230.129.42, 54.230.129.81
Response IP 54.230.129.42
Found Yes
Hash be2646270cc4c9a22acf1a5f7e5627d923e2874171331eee778ad22c68e1de21
SimHash 5139cb61df30

Groups

*

Rule Path
Disallow /data/gtop/test_allpage_crawler.html
Disallow /data/gtop/link_crawler.html

Comments

  • 2017.08.15 更新
  • å
  • Allow: /

Warnings

  • 1 invalid line.