clustdoc.com
robots.txt

Robots Exclusion Standard data for clustdoc.com

Resource Scan

Scan Details

Site Domain clustdoc.com
Base Domain clustdoc.com
Scan Status Ok
Last Scan2025-12-28T14:32:21+00:00
Next Scan 2026-01-27T14:32:21+00:00

Last Scan

Scanned2025-12-28T14:32:21+00:00
URL https://clustdoc.com/robots.txt
Domain IPs 104.26.6.123, 104.26.7.123, 172.67.74.134, 2606:4700:20::681a:67b, 2606:4700:20::681a:77b, 2606:4700:20::ac43:4a86
Response IP 172.67.74.134
Found Yes
Hash 143324338f1efe61f7c86937cfdfc47d03899826850a92aa598ab003ac747544
SimHash 25b0b97397f0

Groups

*

Rule Path
Allow */uploads
Allow /*/*.js
Allow /*/*.css
Allow /wp-*.png
Allow /wp-*.jpg
Allow /wp-*.jpeg
Allow /wp-*.gif
Allow /wp-admin/admin-ajax.php
Disallow /cgi-bin
Disallow /?
Disallow *?wg*=
Disallow /wp-
Disallow /wp/
Disallow *?s=
Disallow *%26s%3D
Disallow /search/
Disallow /author/
Disallow /users/
Disallow */trackback
Disallow */feed
Disallow */rss
Disallow */embed
Disallow */wlwmanifest.xml
Disallow /xmlrpc.php
Disallow *utm*%3D
Disallow *openstat%3D
Disallow /cdn-cgi/

Other Records

Field Value
sitemap https://clustdoc.com/sitemap.xml

Warnings

  • `host` is not a known field.