nytcrossword.org
robots.txt
Robots Exclusion Standard data for nytcrossword.org
Resource Scan
Scan Details
Site Domain | nytcrossword.org |
Base Domain | nytcrossword.org |
Scan Status | Ok |
Last Scan | 2024-05-13T02:06:20+00:00 |
Next Scan | 2024-05-20T02:06:20+00:00 |
Last Scan
Scanned | 2024-05-13T02:06:20+00:00 |
URL | https://nytcrossword.org/robots.txt |
Domain IPs | 2a01:4ff:f0:e30b::1, 5.161.83.186 |
Response IP | 5.161.83.186 |
Found | Yes |
Hash | 476cd379e931d521b190d565b1071049d7b5345b5985184aca1c015e199575fa |
SimHash | 41019222a64e |
Groups
*
Rule | Path |
---|---|
Allow | /wp-admin/admin-ajax.php |
Allow | /wp-admin/images/ |
Allow | /wp-admin/css/ |
Allow | /wp-admin/js/ |
Disallow | /wp-admin |
Disallow | /wp-includes |
Disallow | /wp-content/plugins |
Disallow | /wp-content/cache |
Disallow | /wp-content/themes |
Disallow | /trackback |
Disallow | */trackback |
Disallow | /*?* |
Disallow | /*? |
Disallow | /?s= |
Disallow | /?replytocom=* |
Disallow | /embed/ |
Disallow | /comments/feed/ |
Disallow | /trackback/ |
Disallow | /?referrer= |