pcmag.com
robots.txt
Robots Exclusion Standard data for pcmag.com
Resource Scan
Scan Details
Site Domain | pcmag.com |
Base Domain | pcmag.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-02-23T21:56:24+00:00 |
Next Scan | 2024-05-23T21:56:24+00:00 |
Last Successful Scan
Scanned | 2023-10-27T19:22:46+00:00 |
URL | https://pcmag.com/robots.txt |
Redirect | https://www.pcmag.com/robots.txt |
Redirect Domain | www.pcmag.com |
Redirect Base | pcmag.com |
Domain IPs | 104.16.122.17, 104.16.123.17, 2606:4700::6810:7a11, 2606:4700::6810:7b11 |
Redirect IPs | 104.16.122.17, 104.16.123.17, 2606:4700::6810:7a11, 2606:4700::6810:7b11 |
Response IP | 104.16.123.17 |
Found | Yes |
Hash | 30160375cb58ddce3d61a98bb394451237fc1674000865a3f9d9b08dbf2024f2 |
SimHash | 691cd84ae713 |
Groups
*
Rule | Path |
---|---|
Disallow | /search/ |
Disallow | /archive/ |
Disallow | /otc/ |
Disallow | /api/ |
Disallow | /cdn-cgi/ |
Disallow | /*?page=%5B0-9%5D%5B0-9%5D |
Disallow | /*?page=%5B0-9%5D%5B0-9%5D%5B0-9%5D |
Disallow | /*?page=%5B0-9%5D%5B0-9%5D%5B0-9%5D%5B0-9%5D |
Allow | /*?page=%5B0-9%5D |
Other Records
Field | Value |
---|---|
sitemap | https://www.pcmag.com/sitemap-index.xml |
sitemap | https://www.pcmag.com/sitemap-google-news.xml |