github.com
robots.txt

Robots Exclusion Standard data for github.com

Resource Scan

Scan Details

Site Domain github.com
Base Domain github.com
Scan Status Ok
Last Scan2024-10-29T09:08:17+00:00
Next Scan 2024-11-12T09:08:17+00:00

Last Scan

Scanned2024-10-29T09:08:17+00:00
URL https://github.com/robots.txt
Domain IPs 20.205.243.166
Response IP 20.205.243.166
Found Yes
Hash 7c4c8923a7a422357674dc136a76dd863bfb9c1cec252ce5812005ec5878663b
SimHash 6e0640b3bfc5

Groups

baidu

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

*

Rule Path
Disallow /*/*/pulse
Disallow /*/*/projects
Disallow /*/*/forks
Disallow /*/*/issues/new
Disallow /*/*/issues/search
Disallow /*/*/commits/
Disallow /*/*/branches
Disallow /*/*/contributors
Disallow /*/*/tags
Disallow /*/*/stargazers
Disallow /*/*/watchers
Disallow /*/*/network
Disallow /*/*/graphs
Disallow /*/*/compare
Disallow /*/tree/
Disallow /gist/
Disallow /*/download
Disallow /*/revisions
Disallow /*/commits/*?author
Disallow /*/commits/*?path
Disallow /*/comments
Disallow /*/archive/
Disallow /*/blame/
Disallow /*/raw/
Disallow /*/cache/
Disallow /.git/
Disallow */.git/
Disallow /*.git$
Disallow /search/advanced
Disallow /search$
Disallow /*q%3D
Disallow /*.atom$
Disallow /ekansa/Open-Context-Data
Disallow /ekansa/opencontext-*
Disallow */tarball/
Disallow */zipball/
Disallow /*source%3D*
Disallow /*ref_cta%3D*
Disallow /*plan%3D*
Disallow /*return_to%3D*
Disallow /*ref_loc%3D*
Disallow /*setup_organization%3D*
Disallow /*source_repo%3D*
Disallow /*ref_page%3D*
Disallow /*source%3D*
Disallow /*referrer%3D*
Disallow /*report%3D*
Disallow /*author%3D*
Disallow /*since%3D*
Disallow /*until%3D*
Disallow /*commits?author=*
Disallow /*report-abuse?report=*
Disallow /*tab%3D*
Allow /*?tab=achievements&achievement=*
Disallow /account-login
Disallow /Explodingstuff/

Comments

  • If you would like to crawl GitHub contact us via https://support.github.com?tags=dotcom-robots
  • We also provide an extensive API: https://docs.github.com