git.caraus.tech
robots.txt

Robots Exclusion Standard data for git.caraus.tech

Resource Scan

Scan Details

Site Domain git.caraus.tech
Base Domain caraus.tech
Scan Status Ok
Last Scan2025-12-10T14:32:30+00:00
Next Scan 2025-12-24T14:32:30+00:00

Last Scan

Scanned2025-12-10T14:32:30+00:00
URL https://git.caraus.tech/robots.txt
Domain IPs 159.69.50.180, 2a01:4f8:c013:3061::
Response IP 159.69.50.180
Found Yes
Hash 3361d99441156d1a5415ca9dc5904c2a1e7ee96cc4848f9eb5bb105736621ecb
SimHash 4105d8588bd0

Groups

*

Rule Path
Disallow /*/*/issues/new
Disallow /*/*/projects
Disallow /*/*/actions
Disallow /*/*/actions/
Disallow /*/*/activity
Disallow /*/*/branches
Disallow /*/*/tags
Disallow /*/*/src/commit/
Disallow /*/*/blame/commit/
Disallow /*/*/raw/commit/
Disallow /*/*/compare/
Disallow /*/*/graph
Disallow /*/*/settings
Disallow /*/*/settings/
Disallow /*/*/watchers
Disallow /*/*/stars
Disallow /*/*/forks
Disallow /*/*.rss
Disallow /.git/
Disallow */.git/
Disallow /*.git$
Disallow /api
Disallow /-/admin
Disallow /user/login

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://git.caraus.tech/sitemap.xml