nationalpublicdata.com
robots.txt

Robots Exclusion Standard data for nationalpublicdata.com

Resource Scan

Scan Details

Site Domain nationalpublicdata.com
Base Domain nationalpublicdata.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-12-05T15:06:49+00:00
Next Scan 2026-02-03T15:06:49+00:00

Last Successful Scan

Scanned2025-09-13T01:19:50+00:00
URL https://nationalpublicdata.com/robots.txt
Domain IPs 104.26.0.211, 104.26.1.211, 172.67.71.179, 2606:4700:20::681a:1d3, 2606:4700:20::681a:d3, 2606:4700:20::ac43:47b3
Response IP 172.67.71.179
Found Yes
Hash 7b4e65e6556a73b197eb49f59f072bce025de1e736bb648c983a8394cf3b6979
SimHash 990019048391

Groups

*

Rule Path
Disallow *?s=
Disallow */feed/
Disallow */pd
Disallow /cgi-bin
Disallow /cdn-cgi
Disallow /optout/*

Other Records

Field Value
sitemap https://nationalpublicdata.com/sitemap.xml