/.well-known/

Log In Sign Up

jccmanhattan.org
robots.txt

Robots Exclusion Standard data for jccmanhattan.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	jccmanhattan.org
Base Domain	jccmanhattan.org
Scan Status	Ok
Last Scan	2026-01-03T01:50:36+00:00
Next Scan	2026-02-02T01:50:36+00:00

Last Scan

Scanned	2026-01-03T01:50:36+00:00
URL	https://jccmanhattan.org/robots.txt
Redirect	https://www.mmjccm.org/robots.txt
Redirect Domain	www.mmjccm.org
Redirect Base	mmjccm.org
Domain IPs	67.205.28.74
Redirect IPs	66.33.60.67, 76.76.21.98
Response IP	66.33.60.66
Found	Yes
Hash	522d7e984620617f7778e68f263b0744c0644853954e01453d17fdb273b203d3
SimHash	6d044d15ce92

Groups

*

Rule

Path

Disallow

/api/

Disallow

/admin/

Disallow

/register/

Disallow

/internal/

Allow

/

badbot

Rule

Path

Disallow

/

Back to top

Other Records

Field

Value

sitemap

https://www.mmjccm.org/sitemap.xml

Back to top

Comments

Prevent specific crawlers from indexing your site
Sitemap location

Back to top