gitguardian.com
robots.txt
Robots Exclusion Standard data for gitguardian.com
Resource Scan
Scan Details
Site Domain | gitguardian.com |
Base Domain | gitguardian.com |
Scan Status | Ok |
Last Scan | 2024-09-21T08:45:28+00:00 |
Next Scan | 2024-10-05T08:45:28+00:00 |
Last Scan
Scanned | 2024-09-21T08:45:28+00:00 |
URL | https://gitguardian.com/robots.txt |
Redirect | https://www.gitguardian.com/robots.txt |
Redirect Domain | www.gitguardian.com |
Redirect Base | gitguardian.com |
Domain IPs | 13.248.155.104, 76.223.27.102 |
Redirect IPs | 52.197.0.54, 52.199.221.217, 54.178.223.218 |
Response IP | 52.197.0.54 |
Found | Yes |
Hash | a538c959b41ab94d8c54d675cc6df09117677437ebce8431fce25f13162595c0 |
SimHash | eb394410eb10 |
Groups
*
Rule | Path |
---|---|
Disallow | /work-in-progress/ |
Disallow | /terms/ |
Disallow | /files/code-of-conduct |
Disallow | /files/self-hosted-end-user-licence-agreement |
Disallow | /files/end-user-license-agreement-saas |
Disallow | *?* |
Allow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.gitguardian.com/sitemap.xml |
sitemap | https://www.gitguardian.com/sitemap.xml |