nextgov.com
robots.txt

Robots Exclusion Standard data for nextgov.com

Resource Scan

Scan Details

Site Domain nextgov.com
Base Domain nextgov.com
Scan Status Ok
Last Scan2024-09-28T07:32:28+00:00
Next Scan 2024-10-05T07:32:28+00:00

Last Scan

Scanned2024-09-28T07:32:28+00:00
URL https://nextgov.com/robots.txt
Redirect https://www.nextgov.com/robots.txt
Redirect Domain www.nextgov.com
Redirect Base nextgov.com
Domain IPs 20.119.16.4
Redirect IPs 13.107.246.59, 2620:1ec:bdf::59
Response IP 13.107.246.59
Found Yes
Hash dbcc7ac143523c66787a0885eee4bbb157316364f2e4ebb6b7e01b4583933551
SimHash ab58d8eecbf3

Groups

*

Rule Path
Disallow /mailbag/
Disallow /newsletters/issue/
Disallow /617/
Disallow /session/
Disallow /html/

Other Records

Field Value
crawl-delay 1

magpie-crawler

Rule Path
Disallow /

gptbot

Rule Path
Disallow /