internetcom.jp
robots.txt

Robots Exclusion Standard data for internetcom.jp

Resource Scan

Scan Details

Site Domain internetcom.jp
Base Domain internetcom.jp
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-11-08T15:14:00+00:00
Next Scan 2025-02-06T15:14:00+00:00

Last Successful Scan

Scanned2024-07-12T07:45:20+00:00
URL https://internetcom.jp/robots.txt
Domain IPs 13.114.180.139, 18.180.200.85
Response IP 18.180.200.85
Found Yes
Hash a939595844254ff4bdf7135856d4be5d3cb2d7f3e8a2abe2ce064a169c082b92
SimHash 4d04886165d1

Groups

*

Rule Path
Disallow /click/
Disallow /ad/
Disallow /smartnews_prs/

*
bingbot

Rule Path
Disallow /news/
Allow /news/article/

Other Records

Field Value
sitemap https://internetcom.jp/xml/sitemap_index.xml