nationaljournal.com
robots.txt

Robots Exclusion Standard data for nationaljournal.com

Resource Scan

Scan Details

Site Domain nationaljournal.com
Base Domain nationaljournal.com
Scan Status Ok
Last Scan2024-09-01T08:03:20+00:00
Next Scan 2024-10-01T08:03:20+00:00

Last Scan

Scanned2024-09-01T08:03:20+00:00
URL https://nationaljournal.com/robots.txt
Redirect https://www.nationaljournal.com/robots.txt
Redirect Domain www.nationaljournal.com
Redirect Base nationaljournal.com
Domain IPs 13.107.246.59
Redirect IPs 13.107.246.59, 2620:1ec:bdf::59
Response IP 13.107.246.59
Found Yes
Hash dc279e7d15e79cba821169271df1458bad75befcbd2be8d7abc3b5798642ccb6
SimHash bbf0d9f0fb23

Groups

*

Rule Path
Disallow /almanac
Disallow /best-practices
Disallow /dashboard
Disallow /domesticpolicy
Disallow /events/membership
Disallow /internal
Disallow /login
Disallow /media-university
Disallow /my-account
Disallow /peer-briefings
Disallow /presentations
Disallow /search
Disallow /support
Disallow /whitehouse
Disallow *?rss=1
Disallow *?rss=full
Disallow /vignette/profile/*

Other Records

Field Value
crawl-delay 5

blp_bbot/0.1

Rule Path
Disallow /

magpie-crawler*

Rule Path
Disallow /