jp.reuters.com
robots.txt
Robots Exclusion Standard data for jp.reuters.com
Resource Scan
Scan Details
Site Domain | jp.reuters.com |
Base Domain | reuters.com |
Scan Status | Ok |
Last Scan | 2024-06-27T02:09:01+00:00 |
Next Scan | 2024-07-04T02:09:01+00:00 |
Last Scan
Scanned | 2024-06-27T02:09:01+00:00 |
URL | https://jp.reuters.com/robots.txt |
Domain IPs | 104.88.70.33, 104.88.70.8, 2600:1413:b000:14::b857:c145, 2600:1413:b000:14::b857:c150 |
Response IP | 125.56.219.75 |
Found | Yes |
Hash | 0e8e0fd257fcbb1d7b67edd156b1caf2bfecb04b7c218c8101ba1f11bb55a32c |
SimHash | 140a99408d92 |
Groups
*
Rule | Path |
---|---|
Disallow | /site-search/ |
Disallow | /test/ |
Other Records
Field | Value |
---|---|
sitemap | https://jp.reuters.com/arc/outboundfeeds/sitemap-index/?outputType=xml |
sitemap | https://jp.reuters.com/arc/outboundfeeds/news-sitemap-index/?outputType=xml |
sitemap | https://jp.reuters.com/static/video-sitemap/jp/sitemap_video_index.xml |
Comments