ww.itimes.com
robots.txt
Robots Exclusion Standard data for ww.itimes.com
Resource Scan
Scan Details
Site Domain | ww.itimes.com |
Base Domain | itimes.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a server error. |
Last Scan | 2024-08-29T20:00:05+00:00 |
Next Scan | 2024-11-27T20:00:05+00:00 |
Last Successful Scan
Scanned | 2023-10-12T02:24:04+00:00 |
URL | http://ww.itimes.com/robots.txt |
Domain IPs | 125.56.219.32, 96.17.72.80 |
Response IP | 42.99.140.145 |
Found | Yes |
Hash | 5e761fb299e79e647ea8983ca70fc4e83c74bc2ba1a4902d8a3ea39229cc2591 |
SimHash | 680588028392 |
Groups
*
Rule | Path |
---|---|
Disallow | /*/*/ajax |
Disallow | /*?* |
Disallow | /esi-home |
Disallow | /citizen-journalism/how-an-inorganic-semiconductor-photovoltaic-works |