nyt.ru
robots.txt
Robots Exclusion Standard data for nyt.ru
Resource Scan
Scan Details
Site Domain | nyt.ru |
Base Domain | nyt.ru |
Scan Status | Ok |
Last Scan | 2024-04-25T11:49:33+00:00 |
Next Scan | 2024-05-25T11:49:33+00:00 |
Last Scan
Scanned | 2024-04-25T11:49:33+00:00 |
URL | https://nyt.ru/robots.txt |
Domain IPs | 91.227.198.103 |
Response IP | 91.227.198.103 |
Found | Yes |
Hash | e83793b9b5b6d09eb03650c760179e990243a75a9e2c5513116dd5ee9da22ff0 |
SimHash | 080b81d6ae95 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin |
Disallow | /manager/ |
Disallow | /assets/components/ |
Disallow | /core/ |
Disallow | /connectors/ |
Disallow | /index.php |
Disallow | /cabinet/ |
Disallow | /*? |
Disallow | /*index/ |
Disallow | /politika-konfidenczialnosti/ |
Disallow | /corp-price/ |
Allow | /*.js |
Allow | /*.css |
Allow | /*.jpeg |
Allow | /*.png |
Allow | /*.svg |
Allow | /*?page= |
Allow | /faq/index/ |
Other Records
Field | Value |
---|---|
sitemap | https://nyt.ru/sitemap.xml |