gdz.cloud
robots.txt
Robots Exclusion Standard data for gdz.cloud
Resource Scan
Scan Details
Site Domain | gdz.cloud |
Base Domain | gdz.cloud |
Scan Status | Ok |
Last Scan | 2024-11-13T01:32:19+00:00 |
Next Scan | 2024-11-20T01:32:19+00:00 |
Last Scan
Scanned | 2024-11-13T01:32:19+00:00 |
URL | https://gdz.cloud/robots.txt |
Domain IPs | 104.21.23.156, 172.67.211.179, 2606:4700:3035::ac43:d3b3, 2606:4700:3037::6815:179c |
Response IP | 104.21.23.156 |
Found | Yes |
Hash | 5b73dd88bd96c09f8d8b35fc92d214fda8c15e78b67f579481ed8928727e2f64 |
SimHash | 6145d658c7d3 |
Groups
*
Rule | Path |
---|---|
Disallow | /admin |
Disallow | /search |
Disallow | /404 |
Disallow | /copyright-notice |
Disallow | /tag |
Disallow | /*page |
Disallow | /*trackback |
Disallow | /*feed |
Disallow | /*comments |
Other Records
Field | Value |
---|---|
sitemap | https://gdz.cloud/sitemap/sitemap.xml |
Warnings
- `host` is not a known field.