wnbc.com
robots.txt
Robots Exclusion Standard data for wnbc.com
Resource Scan
Scan Details
Site Domain | wnbc.com |
Base Domain | wnbc.com |
Scan Status | Ok |
Last Scan | 2024-11-13T08:17:54+00:00 |
Next Scan | 2024-11-20T08:17:54+00:00 |
Last Scan
Scanned | 2024-11-13T08:17:54+00:00 |
URL | https://wnbc.com/robots.txt |
Redirect | https://www.nbcnewyork.com/robots.txt |
Redirect Domain | www.nbcnewyork.com |
Redirect Base | nbcnewyork.com |
Domain IPs | 192.0.66.2 |
Redirect IPs | 23.203.73.218 |
Response IP | 23.15.97.106 |
Found | Yes |
Hash | 329691e34e7e35839235ea9d076a0bef2c2c47004b2f9a0cdb5471c8b4b831e7 |
SimHash | 6dfcd033cec7 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Allow | /wp-admin/admin-ajax.php |
Allow | /.well-known/amphtml/apikey.pub |
Allow | /?rss=y&most_recent=y |
Disallow | /wp-content/uploads/sites/*/ninja-forms/* |
Disallow | /?s=* |
Disallow | /templates/nbc_partner_player_amp?* |
Disallow | /results/* |
Disallow | /topics?* |
Disallow | /https* |
Disallow | /includes/*.js |
Disallow | /?cardId=* |
Disallow | /liveblog/*/* |
Disallow | /liveblog/*/?cardId=* |
Disallow | *customize_changeset_uuid%3D |
Disallow | *customize_autosaved%3D |
Other Records
Field | Value |
---|---|
sitemap | https://www.nbcnewyork.com/sitemap.xml |
sitemap | https://www.nbcnewyork.com/sitemap.xml?type=video |
sitemap | https://www.nbcnewyork.com/sitemap.xml?type=category |
sitemap | https://www.nbcnewyork.com/sitemap-news.xml |
Warnings
- 21 invalid lines.
Comments