static01.nyt.com
robots.txt
Robots Exclusion Standard data for static01.nyt.com
Resource Scan
Scan Details
Site Domain | static01.nyt.com |
Base Domain | nyt.com |
Scan Status | Ok |
Last Scan | 2024-06-17T02:41:33+00:00 |
Next Scan | 2024-07-01T02:41:33+00:00 |
Last Scan
Scanned | 2024-06-17T02:41:33+00:00 |
URL | https://static01.nyt.com/robots.txt |
Domain IPs | 151.101.1.164, 151.101.129.164, 151.101.193.164, 151.101.65.164 |
Response IP | 199.232.45.164 |
Found | Yes |
Hash | 3c468117ad78918361661207f9242920cec7839841ebb9637df2dc2fe549db1b |
SimHash | 7d41111a2775 |
Groups
*
Rule | Path |
---|---|
Disallow | /pages/college/ |
Disallow | /college/ |
Disallow | /library/ |
Disallow | /learning/ |
Disallow | /aponline/ |
Disallow | /reuters/ |
Disallow | /cnet/ |
Disallow | /partners/ |
Disallow | /archives/ |
Disallow | /indexes/ |
Disallow | /adx/bin/ |
Disallow | /thestreet/ |
Disallow | /nytimes-partners/ |
Disallow | /financialtimes/ |
Disallow | /email-content/ |
Allow | /pages/ |
Allow | /2003/ |
Allow | /2004/ |
Allow | /2005/ |
Allow | /top/ |
Allow | /ref/ |
Allow | /services/xml/ |
Comments