governmentcurated.com
robots.txt
Robots Exclusion Standard data for governmentcurated.com
Resource Scan
Scan Details
Site Domain | governmentcurated.com |
Base Domain | governmentcurated.com |
Scan Status | Ok |
Last Scan | 2025-05-17T22:43:59+00:00 |
Next Scan | 2025-05-24T22:43:59+00:00 |
Last Scan
Scanned | 2025-05-17T22:43:59+00:00 |
URL | https://governmentcurated.com/robots.txt |
Domain IPs | 104.21.50.65, 172.67.157.142, 2606:4700:3037::6815:3241, 2606:4700:3037::ac43:9d8e |
Response IP | 172.67.157.142 |
Found | Yes |
Hash | 9599a9eab7f1fed2c67d299c0e4b095d42c81cdf2e0496d693cd768953ec98f6 |
SimHash | 69048310cb96 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin |
Disallow | /wp-* |
Disallow | /*?* |
Allow | */uploads |
Allow | /wp-*/*.css |
Allow | /wp-*/*.js |
Allow | /wp-*/*.jpg |
Allow | /wp-*/*.mp4 |
Allow | /wp-*/*.woff |
Allow | /wp-*/*.ttf |
Allow | /wp-*/*.png |
Allow | /wp-*/*.svg |
Allow | /wp-*/*.webp |
Other Records
Field | Value |
---|---|
sitemap | https://governmentcurated.com/sitemap_index.xml |