govtrack.us
robots.txt
Robots Exclusion Standard data for govtrack.us
Resource Scan
Scan Details
Site Domain | govtrack.us |
Base Domain | govtrack.us |
Scan Status | Ok |
Last Scan | 2024-10-06T21:32:09+00:00 |
Next Scan | 2024-10-13T21:32:09+00:00 |
Last Scan
Scanned | 2024-10-06T21:32:09+00:00 |
URL | https://govtrack.us/robots.txt |
Redirect | https://www.govtrack.us/robots.txt |
Redirect Domain | www.govtrack.us |
Redirect Base | govtrack.us |
Domain IPs | 72.249.66.95 |
Redirect IPs | 72.249.66.95 |
Response IP | 72.249.66.95 |
Found | Yes |
Hash | d578ff440c09fa306864ab76580d131a59b8f629d937eee159efb437b00b0d0b |
SimHash | 7b0e5151c034 |
Groups
googlebot
Rule | Path |
---|---|
Disallow | /data |
Disallow | /registration/ext |
Disallow | /accounts |
Disallow | /api |
Disallow | */xml |
Disallow | */details |
Disallow | */widget |
Disallow | */_text_image |
Disallow | *?* |
Other Records
Field | Value |
---|---|
crawl-delay | 3 |
mediapartners-google
Rule | Path |
---|---|
Disallow | /data |
Disallow | /registration/ext |
Disallow | /accounts |
Disallow | /api |
Disallow | */xml |
Disallow | */details |
Disallow | */widget |
Disallow | */_text_image |
Disallow | *?* |
Other Records
Field | Value |
---|---|
crawl-delay | 3 |
slurp
Rule | Path |
---|---|
Disallow | /data |
Disallow | /registration/ext |
Disallow | /accounts |
Disallow | /api |
Disallow | */xml |
Disallow | */details |
Disallow | */widget |
Disallow | */_text_image |
Disallow | *?* |
Other Records
Field | Value |
---|---|
crawl-delay | 5 |
bingbot
Rule | Path |
---|---|
Disallow | /data |
Disallow | /registration/ext |
Disallow | /accounts |
Disallow | /api |
Disallow | */xml |
Disallow | */details |
Disallow | */widget |
Disallow | */_text_image |
Disallow | *?* |
Other Records
Field | Value |
---|---|
crawl-delay | 7 |
*
Rule | Path |
---|---|
Disallow | /data |
Disallow | /registration/ext |
Disallow | /accounts |
Disallow | /api |
Disallow | */xml |
Disallow | */details |
Disallow | */widget |
Disallow | */_text_image |
Disallow | *?* |
Other Records
Field | Value |
---|---|
crawl-delay | 30 |
amazonbot
anthropic-ai
applebot-extended
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
friendlycrawler
google-extended
googleother
googleother-image
googleother-video
gptbot
imagesiftbot
img2dataset
meta-externalagent
oai-searchbot
omgili
omgilibot
perplexitybot
timpibot
velenpublicwebcrawler
youbot
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.govtrack.us/sitemap.xml |
Comments