govtrack.us
robots.txt

Robots Exclusion Standard data for govtrack.us

Resource Scan

Scan Details

Site Domain govtrack.us
Base Domain govtrack.us
Scan Status Ok
Last Scan2024-10-06T21:32:09+00:00
Next Scan 2024-10-13T21:32:09+00:00

Last Scan

Scanned2024-10-06T21:32:09+00:00
URL https://govtrack.us/robots.txt
Redirect https://www.govtrack.us/robots.txt
Redirect Domain www.govtrack.us
Redirect Base govtrack.us
Domain IPs 72.249.66.95
Redirect IPs 72.249.66.95
Response IP 72.249.66.95
Found Yes
Hash d578ff440c09fa306864ab76580d131a59b8f629d937eee159efb437b00b0d0b
SimHash 7b0e5151c034

Groups

googlebot

Rule Path
Disallow /data
Disallow /registration/ext
Disallow /accounts
Disallow /api
Disallow */xml
Disallow */details
Disallow */widget
Disallow */_text_image
Disallow *?*

Other Records

Field Value
crawl-delay 3

mediapartners-google

Rule Path
Disallow /data
Disallow /registration/ext
Disallow /accounts
Disallow /api
Disallow */xml
Disallow */details
Disallow */widget
Disallow */_text_image
Disallow *?*

Other Records

Field Value
crawl-delay 3

slurp

Rule Path
Disallow /data
Disallow /registration/ext
Disallow /accounts
Disallow /api
Disallow */xml
Disallow */details
Disallow */widget
Disallow */_text_image
Disallow *?*

Other Records

Field Value
crawl-delay 5

bingbot

Rule Path
Disallow /data
Disallow /registration/ext
Disallow /accounts
Disallow /api
Disallow */xml
Disallow */details
Disallow */widget
Disallow */_text_image
Disallow *?*

Other Records

Field Value
crawl-delay 7

*

Rule Path
Disallow /data
Disallow /registration/ext
Disallow /accounts
Disallow /api
Disallow */xml
Disallow */details
Disallow */widget
Disallow */_text_image
Disallow *?*

Other Records

Field Value
crawl-delay 30

amazonbot
anthropic-ai
applebot-extended
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
friendlycrawler
google-extended
googleother
googleother-image
googleother-video
gptbot
imagesiftbot
img2dataset
meta-externalagent
oai-searchbot
omgili
omgilibot
perplexitybot
timpibot
velenpublicwebcrawler
youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.govtrack.us/sitemap.xml

Comments

  • Google
  • Yahoo
  • Bing
  • Everyone Else
  • https://github.com/ai-robots-txt/ai.robots.txt/blob/main/robots.txt
  • Sitemap