jccmanhattan.org
robots.txt

Robots Exclusion Standard data for jccmanhattan.org

Resource Scan

Scan Details

Site Domain jccmanhattan.org
Base Domain jccmanhattan.org
Scan Status Ok
Last Scan2026-01-03T01:50:36+00:00
Next Scan 2026-02-02T01:50:36+00:00

Last Scan

Scanned2026-01-03T01:50:36+00:00
URL https://jccmanhattan.org/robots.txt
Redirect https://www.mmjccm.org/robots.txt
Redirect Domain www.mmjccm.org
Redirect Base mmjccm.org
Domain IPs 67.205.28.74
Redirect IPs 66.33.60.67, 76.76.21.98
Response IP 66.33.60.66
Found Yes
Hash 522d7e984620617f7778e68f263b0744c0644853954e01453d17fdb273b203d3
SimHash 6d044d15ce92

Groups

*

Rule Path
Disallow /api/
Disallow /admin/
Disallow /register/
Disallow /internal/
Allow /

badbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.mmjccm.org/sitemap.xml

Comments

  • Prevent specific crawlers from indexing your site
  • Sitemap location