commons.gc.cuny.edu
robots.txt
Robots Exclusion Standard data for commons.gc.cuny.edu
Resource Scan
Scan Details
Site Domain | commons.gc.cuny.edu |
Base Domain | cuny.edu |
Scan Status | Ok |
Last Scan | 2024-09-25T22:27:06+00:00 |
Next Scan | 2024-10-25T22:27:06+00:00 |
Last Scan
Scanned | 2024-09-25T22:27:06+00:00 |
URL | https://commons.gc.cuny.edu/robots.txt |
Domain IPs | 146.96.128.200 |
Response IP | 146.96.128.200 |
Found | Yes |
Hash | 2a26801904d949350443620b1be0a81cbba1633a78daa726620a61431780259b |
SimHash | 405c5062ea83 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Disallow | /wp-includes/*.php |
Disallow | /wp-content/cache |
Disallow | /trackback |
Disallow | /comments |
Disallow | /wiki/ |
Other Records
Field | Value |
---|---|
crawl-delay | 100 |
Other Records
Field | Value |
---|---|
sitemap | https://commons.gc.cuny.edu/wp-sitemap.xml |