commons.gc.cuny.edu
robots.txt

Robots Exclusion Standard data for commons.gc.cuny.edu

Resource Scan

Scan Details

Site Domain commons.gc.cuny.edu
Base Domain cuny.edu
Scan Status Ok
Last Scan2024-09-25T22:27:06+00:00
Next Scan 2024-10-25T22:27:06+00:00

Last Scan

Scanned2024-09-25T22:27:06+00:00
URL https://commons.gc.cuny.edu/robots.txt
Domain IPs 146.96.128.200
Response IP 146.96.128.200
Found Yes
Hash 2a26801904d949350443620b1be0a81cbba1633a78daa726620a61431780259b
SimHash 405c5062ea83

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/*.php
Disallow /wp-content/cache
Disallow /trackback
Disallow /comments
Disallow /wiki/

Other Records

Field Value
crawl-delay 100

ahrefsbot
claudebot
dataforseobot
dotbot
gptbot
seekportbot
semrushbot
mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

bytespider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://commons.gc.cuny.edu/wp-sitemap.xml