www.nyscr.ny.gov
robots.txt

Robots Exclusion Standard data for www.nyscr.ny.gov

Resource Scan

Scan Details

Site Domain www.nyscr.ny.gov
Base Domain ny.gov
Scan Status Ok
Last Scan2024-08-28T18:54:41+00:00
Next Scan 2024-09-27T18:54:41+00:00

Last Scan

Scanned2024-08-28T18:54:41+00:00
URL https://www.nyscr.ny.gov/robots.txt
Domain IPs 38.74.66.26
Response IP 38.74.66.26
Found Yes
Hash 478f04075b3df5d70bd06d5863f7fa56b32553fbc84d57ed22c3407e3dbc3f9c
SimHash 884bfc00331a

Groups

*

Rule Path
Disallow /agency
Disallow /business
Disallow /css
Disallow /daily
Disallow /documents
Disallow /error
Disallow /feeds
Disallow /fonts
Disallow /images
Disallow /includes
Disallow /js
Disallow /pre-registration
Disallow /ScheduledTasks
Disallow /systemadmin
Disallow /Application.cfc
Disallow /adsArchive.cfm
Disallow /bulletinsShare.cfm
Disallow /bulletinsSharex.cfm
Disallow /bulletinsView.cfm
Disallow /bulletinsViewPDF.cfm
Disallow /eventsShare.cfm
Disallow /eventsSharex.cfm
Disallow /eventsView.cfm
Disallow /eventsViewPDF.cfm
Disallow /newAgency1.cfm
Disallow /newAgency2.cfm
Disallow /newAgency3.cfm
Disallow /newAgency4.cfm
Disallow /register1.cfm
Disallow /register2.cfm
Disallow /register3.cfm
Disallow /reports.cfm
Disallow /web.config
Disallow /policies_coming_soon.cfm