theactivetimes.net
robots.txt
Robots Exclusion Standard data for theactivetimes.net
Resource Scan
Scan Details
Site Domain | theactivetimes.net |
Base Domain | theactivetimes.net |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2024-04-02T10:04:32+00:00 |
Next Scan | 2024-07-01T10:04:32+00:00 |
Last Successful Scan
Scanned | 2023-03-10T05:09:56+00:00 |
URL | https://theactivetimes.net/robots.txt |
Redirect | https://www.explore.com:443/robots.txt |
Redirect Domain | www.explore.com |
Redirect Base | explore.com |
Domain IPs | 23.185.0.1, 2620:12a:8000::1, 2620:12a:8001::1 |
Redirect IPs | 13.33.88.14, 13.33.88.52, 13.33.88.75, 13.33.88.93 |
Response IP | 13.33.21.6 |
Found | Yes |
Hash | e3d4a0c751bb25b90a0b31a25ba338f1ede1a889d36252fcb1da7dbf001655a3 |
SimHash | f00458482991 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Disallow | /wp-includes/ |
Disallow | /*?*ajax= |
Disallow | /*?*zergnet |
Disallow | /*/s/* |
Disallow | /*/sl/* |
Disallow | /search/ |
Disallow | *mode%3Dprint |
Disallow | *mode%3Dgn |
Other Records
Field | Value |
---|---|
sitemap | https://www.explore.com/sitemap_index.xml |
sitemap | https://www.explore.com/stories/sitemap-index.xml |
sitemap | https://www.explore.com/?getfeed=google |