us.newyorktimesnow.com
robots.txt

Robots Exclusion Standard data for us.newyorktimesnow.com

Resource Scan

Scan Details

Site Domain us.newyorktimesnow.com
Base Domain newyorktimesnow.com
Scan Status Ok
Last Scan2024-08-31T20:27:26+00:00
Next Scan 2024-09-30T20:27:26+00:00

Last Scan

Scanned2024-08-31T20:27:26+00:00
URL https://us.newyorktimesnow.com/robots.txt
Domain IPs 104.21.36.135, 172.67.194.212, 2606:4700:3036::ac43:c2d4, 2606:4700:3037::6815:2487
Response IP 172.67.194.212
Found Yes
Hash 19e7ae8bea276087a17f49ae03a5afc5140111067b543ab15cbb8a41a7ee86d0
SimHash 622de44c43f2

Groups

*

Rule Path
Disallow /assets
Disallow /cache
Disallow /sources
Disallow /api
Disallow /script_backups
Disallow /updates
Disallow /install
Disallow /admincp
Disallow /admin-panel
Disallow /ajax_loading.php
Disallow /api.php
Disallow /xml
Disallow /system_status.php
Disallow /nodejs