origin-www.nycgo.com
robots.txt
Robots Exclusion Standard data for origin-www.nycgo.com
Resource Scan
Scan Details
Site Domain | origin-www.nycgo.com |
Base Domain | nycgo.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-03-15T21:01:39+00:00 |
Next Scan | 2024-06-13T21:01:39+00:00 |
Last Successful Scan
Scanned | 2023-10-25T19:38:03+00:00 |
URL | https://origin-www.nycgo.com/robots.txt |
Domain IPs | 13.33.33.120, 13.33.33.27, 13.33.33.39, 13.33.33.87 |
Response IP | 13.33.33.39 |
Found | Yes |
Hash | 18c33a0b378d237592fb088972a04cdb6920092e0c261174da0c6d2df3fa12af |
SimHash | 910d5b524dd5 |
Groups
*
Rule | Path |
---|---|
Disallow | /browse/* |
Disallow | /search* |
Disallow | /category* |
Disallow | /assets* |
Disallow | /content* |
Disallow | /crawl* |
Disallow | /listicles* |
Disallow | /slideshows* |
Disallow | /venue/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.nycgo.com/sitemap.xml |