pcmag.com
robots.txt

Robots Exclusion Standard data for pcmag.com

Resource Scan

Scan Details

Site Domain pcmag.com
Base Domain pcmag.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-02-23T21:56:24+00:00
Next Scan 2024-05-23T21:56:24+00:00

Last Successful Scan

Scanned2023-10-27T19:22:46+00:00
URL https://pcmag.com/robots.txt
Redirect https://www.pcmag.com/robots.txt
Redirect Domain www.pcmag.com
Redirect Base pcmag.com
Domain IPs 104.16.122.17, 104.16.123.17, 2606:4700::6810:7a11, 2606:4700::6810:7b11
Redirect IPs 104.16.122.17, 104.16.123.17, 2606:4700::6810:7a11, 2606:4700::6810:7b11
Response IP 104.16.123.17
Found Yes
Hash 30160375cb58ddce3d61a98bb394451237fc1674000865a3f9d9b08dbf2024f2
SimHash 691cd84ae713

Groups

*

Rule Path
Disallow /search/
Disallow /archive/
Disallow /otc/
Disallow /api/
Disallow /cdn-cgi/
Disallow /*?page=%5B0-9%5D%5B0-9%5D
Disallow /*?page=%5B0-9%5D%5B0-9%5D%5B0-9%5D
Disallow /*?page=%5B0-9%5D%5B0-9%5D%5B0-9%5D%5B0-9%5D
Allow /*?page=%5B0-9%5D

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.pcmag.com/sitemap-index.xml
sitemap https://www.pcmag.com/sitemap-google-news.xml