wtopnews.com
robots.txt
Robots Exclusion Standard data for wtopnews.com
Resource Scan
Scan Details
Site Domain | wtopnews.com |
Base Domain | wtopnews.com |
Scan Status | Failed |
Failure Reason | Scan timed out. |
Last Scan | 2024-09-17T11:44:41+00:00 |
Next Scan | 2024-12-16T11:44:41+00:00 |
Last Successful Scan
Scanned | 2022-11-05T10:30:03+00:00 |
URL | http://wtopnews.com/robots.txt |
Redirect | https://wtop.com/robots.txt |
Redirect Domain | wtop.com |
Redirect Base | wtop.com |
Response IP | 151.101.130.217, 151.101.2.217, 151.101.194.217, 151.101.66.217 |
Found | Yes |
Hash | 084494bb205292ccca7823a8f772fa93913e46b73c491979e255f4591d2c851b |
SimHash | 012048298b31 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Allow | /wp-admin/admin-ajax.php |
Disallow | /crossdomain.xml |
Disallow | /eyeblaster/ |
Disallow | /*/mraid.js |
Disallow | /plugins/ |
Disallow | /search/$ |
Disallow | /search$ |
Other Records
Field | Value |
---|---|
sitemap | https://wtop.com/wtop_sitemap_index.xml |