excite.co.jp
robots.txt
Robots Exclusion Standard data for excite.co.jp
Resource Scan
Scan Details
Site Domain | excite.co.jp |
Base Domain | excite.co.jp |
Scan Status | Ok |
Last Scan | 2024-11-08T07:05:59+00:00 |
Next Scan | 2024-11-15T07:05:59+00:00 |
Last Scan
Scanned | 2024-11-08T07:05:59+00:00 |
URL | https://www.excite.co.jp/robots.txt |
Domain IPs | 54.230.71.115, 54.230.71.24, 54.230.71.34, 54.230.71.53 |
Response IP | 13.35.210.16 |
Found | Yes |
Hash | 978818ff47980672245c6a3d96d534486f037e92f3cb629edd10e4b94e9a44ab |
SimHash | 3405476cc954 |
Groups
*
Rule | Path |
---|---|
Disallow | /search.gw |
Disallow | /world/english/web/body |
Disallow | /world/chinese/web/body |
Disallow | /world/korean/web/body |
Disallow | /world/french/web/body |
Disallow | /world/german/web/body |
Disallow | /world/italian/web/body |
Disallow | /world/spanish/web/body |
Disallow | /world/portuguese/web/body |
Disallow | /world/russian/web/body |
Disallow | /world/english/taiyaku |
Disallow | /relocate/ |
Disallow | /relocate2/ |
Disallow | /relocate3/ |
Disallow | /tool/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.excite.co.jp/news/sitemap.xml |