nyscr.ny.gov
robots.txt

Robots Exclusion Standard data for nyscr.ny.gov

Resource Scan

Scan Details

Site Domain nyscr.ny.gov
Base Domain ny.gov
Scan Status Ok
Last Scan2024-05-30T19:55:12+00:00
Next Scan 2024-06-29T19:55:12+00:00

Last Scan

Scanned2024-05-30T19:55:12+00:00
URL https://nyscr.ny.gov/robots.txt
Redirect https://www.nyscr.ny.gov/robots.txt
Redirect Domain www.nyscr.ny.gov
Redirect Base ny.gov
Domain IPs 38.74.66.26
Redirect IPs 38.74.66.26
Response IP 38.74.66.26
Found Yes
Hash 478f04075b3df5d70bd06d5863f7fa56b32553fbc84d57ed22c3407e3dbc3f9c
SimHash 884bfc00331a

Groups

*

Rule Path
Disallow /agency
Disallow /business
Disallow /css
Disallow /daily
Disallow /documents
Disallow /error
Disallow /feeds
Disallow /fonts
Disallow /images
Disallow /includes
Disallow /js
Disallow /pre-registration
Disallow /ScheduledTasks
Disallow /systemadmin
Disallow /Application.cfc
Disallow /adsArchive.cfm
Disallow /bulletinsShare.cfm
Disallow /bulletinsSharex.cfm
Disallow /bulletinsView.cfm
Disallow /bulletinsViewPDF.cfm
Disallow /eventsShare.cfm
Disallow /eventsSharex.cfm
Disallow /eventsView.cfm
Disallow /eventsViewPDF.cfm
Disallow /newAgency1.cfm
Disallow /newAgency2.cfm
Disallow /newAgency3.cfm
Disallow /newAgency4.cfm
Disallow /register1.cfm
Disallow /register2.cfm
Disallow /register3.cfm
Disallow /reports.cfm
Disallow /web.config
Disallow /policies_coming_soon.cfm