nextcloud.com
robots.txt

Robots Exclusion Standard data for nextcloud.com

Resource Scan

Scan Details

Site Domain nextcloud.com
Base Domain nextcloud.com
Scan Status Ok
Last Scan2025-03-20T13:29:53+00:00
Next Scan 2025-04-19T13:29:53+00:00

Last Scan

Scanned2025-03-20T13:29:53+00:00
URL https://nextcloud.com/robots.txt
Domain IPs 2a01:4f8:a0:3068::2, 85.10.195.17
Response IP 85.10.195.17
Found Yes
Hash e3c9da81e561391d623d2ce32e43554b494ef34cc21374d5b5f2de3291f8c426
SimHash 3d554f054931

Groups

*

Rule Path
Disallow /*.pdf$
Disallow /media/*.pdf

Comments

  • This section blocks all PDF files in the /media folder from being indexed