projects.newsday.com
robots.txt

Robots Exclusion Standard data for projects.newsday.com

Resource Scan

Scan Details

Site Domain projects.newsday.com
Base Domain newsday.com
Scan Status Ok
Last Scan2024-05-03T09:57:50+00:00
Next Scan 2024-06-02T09:57:50+00:00

Last Scan

Scanned2024-05-03T09:57:50+00:00
URL https://projects.newsday.com/robots.txt
Domain IPs 2600:9000:2024:1600:15:2476:6480:93a1, 2600:9000:2024:2400:15:2476:6480:93a1, 2600:9000:2024:3400:15:2476:6480:93a1, 2600:9000:2024:5a00:15:2476:6480:93a1, 2600:9000:2024:8200:15:2476:6480:93a1, 2600:9000:2024:b000:15:2476:6480:93a1, 2600:9000:2024:de00:15:2476:6480:93a1, 2600:9000:2024:e200:15:2476:6480:93a1, 65.9.112.112, 65.9.112.115, 65.9.112.15, 65.9.112.3
Response IP 18.165.171.31
Found Yes
Hash e5153e78f2bbf05f04e9d616927e96e115a31ef02438b824e3ad4b267ee6b820
SimHash 6b1c9e936f75

Groups

*

Rule Path
Allow /
Disallow /wp-includes
Disallow /wp-admin
Disallow /tyrion
Disallow /_common
Disallow /common
Disallow /services
Allow /services/podcast

Other Records

Field Value
sitemap https://projects.newsday.com/sitemap_index.xml
sitemap https://projects.newsday.com/voters-guide/sitemap.xml
sitemap https://projects.newsday.com/schools/sitemap.xml
sitemap https://projects.newsday.com/payrolls/sitemap_index.xml
sitemap https://projects.newsday.com/services/podcast/sitemap