nyceac.org
robots.txt

Robots Exclusion Standard data for nyceac.org

Resource Scan

Scan Details

Site Domain nyceac.org
Base Domain nyceac.org
Scan Status Ok
Last Scan2025-11-28T18:05:25+00:00
Next Scan 2025-12-28T18:05:25+00:00

Last Scan

Scanned2025-11-28T18:05:25+00:00
URL https://nyceac.org/robots.txt
Domain IPs 104.21.65.10, 172.67.157.21, 2606:4700:3034::6815:410a, 2606:4700:3037::ac43:9d15
Response IP 172.67.157.21
Found Yes
Hash 64e4d3fe84f23340cb1bc03c651ff0c2ce33a3e48d9a419993f1fff3db4cf80d
SimHash 48155970c933

Groups

*

Rule Path
Allow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://nyceac.org/sitemap.xml