code.google.com
robots.txt
Robots Exclusion Standard data for code.google.com
Resource Scan
Scan Details
Site Domain | code.google.com |
Base Domain | google.com |
Scan Status | Ok |
Last Scan | 2024-04-30T17:51:15+00:00 |
Next Scan | 2024-05-30T17:51:15+00:00 |
Last Scan
Scanned | 2024-04-30T17:51:15+00:00 |
URL | https://code.google.com/robots.txt |
Domain IPs | 2404:6800:4003:c04::65, 2404:6800:4003:c04::66, 2404:6800:4003:c04::71, 2404:6800:4003:c04::8a, 64.233.170.100, 64.233.170.101, 64.233.170.102, 64.233.170.113, 64.233.170.138, 64.233.170.139 |
Response IP | 142.251.175.100 |
Found | Yes |
Hash | 052a0c9b47bc67736489319f07ef0e18defbd675121a1284ac03741eb0296526 |
SimHash | 6130c4520359 |
Groups
*
Rule | Path |
---|---|
Disallow | /hosting/search |
Disallow | /p/*/issues/csv |
Disallow | /p/*/source/diff |
Disallow | /p/*/people/detail |
Disallow | /u/* |
Disallow | /a/ |
Allow | /a/eclipselabs.org/ |
Allow | /a/apache-extras.org/ |
Disallow | /a/*/hosting/search |
Disallow | /a/*/p/*/issues/csv |
Disallow | /a/*/p/*/source/diff |
Disallow | /p/blackout-m7/ |
Other Records
Field | Value |
---|---|
crawl-delay | 120 |