documentsnap.com
robots.txt

Robots Exclusion Standard data for documentsnap.com

Resource Scan

Scan Details

Site Domain documentsnap.com
Base Domain documentsnap.com
Scan Status Ok
Last Scan2025-11-01T10:17:39+00:00
Next Scan 2025-12-01T10:17:39+00:00

Last Scan

Scanned2025-11-01T10:17:39+00:00
URL https://documentsnap.com/robots.txt
Domain IPs 194.1.147.48, 194.1.147.57
Response IP 194.1.147.48
Found Yes
Hash 311d505bfe95df8db8865249b02cfc83de0796ec00f1113fc940986f5ea2afbf
SimHash b9087c9cc0b2

Groups

*

Rule Path
Disallow /_*
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/backup*
Disallow /wp-content/themes/
Disallow /wp-login.php
Disallow /4-ways-to-tame-your-documents/
Disallow /teleseminar-details/
Disallow /teleseminar-download-details/
Disallow /confirm-teleseminar/
Disallow /paperless-document-organizer-thank-you/
Disallow /confirm-paperless-document-organizer/
Disallow /pdo-truecrypt/
Disallow /dpo-mysetup/
Disallow /pdo-spotlight/
Disallow /pdo-winsearch/
Disallow /dpo-maesx/
Disallow /pdo-spuhq/
Disallow /dpo-poqm/
Disallow /dpo-sparsebundle/
Disallow /pdo-using-evernote/
Disallow /hazel-webinar-thank-you/
Disallow /hazel-webinar-recording/
Disallow /dotto-evernote-bonus
Disallow /dotto-evernote-webinar-grab-bonuses/
Disallow /loves/*
Disallow /pdog-upgrade-green-gold/
Disallow /pdog-upgrade-green-platinum/
Disallow /pdog-upgrade-gold-platinum/

Other Records

Field Value
sitemap http://www.documentsnap.com/sitemap.xml.gz

Comments

  • don’t search for files in these directories
  • For Google XML sitemaps