gdz.cloud
robots.txt

Robots Exclusion Standard data for gdz.cloud

Resource Scan

Scan Details

Site Domain gdz.cloud
Base Domain gdz.cloud
Scan Status Ok
Last Scan2024-11-13T01:32:19+00:00
Next Scan 2024-11-20T01:32:19+00:00

Last Scan

Scanned2024-11-13T01:32:19+00:00
URL https://gdz.cloud/robots.txt
Domain IPs 104.21.23.156, 172.67.211.179, 2606:4700:3035::ac43:d3b3, 2606:4700:3037::6815:179c
Response IP 104.21.23.156
Found Yes
Hash 5b73dd88bd96c09f8d8b35fc92d214fda8c15e78b67f579481ed8928727e2f64
SimHash 6145d658c7d3

Groups

*

Rule Path
Disallow /admin
Disallow /search
Disallow /404
Disallow /copyright-notice
Disallow /tag
Disallow /*page
Disallow /*trackback
Disallow /*feed
Disallow /*comments

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap https://gdz.cloud/sitemap/sitemap.xml

Warnings

  • `host` is not a known field.