scanmydocument.com
robots.txt

Robots Exclusion Standard data for scanmydocument.com

Resource Scan

Scan Details

Site Domain scanmydocument.com
Base Domain scanmydocument.com
Scan Status Ok
Last Scan2025-06-21T18:17:59+00:00
Next Scan 2025-06-28T18:17:59+00:00

Last Scan

Scanned2025-06-21T18:17:59+00:00
URL https://scanmydocument.com/robots.txt
Redirect https://www.scanmydocument.com/robots.txt
Redirect Domain www.scanmydocument.com
Redirect Base scanmydocument.com
Domain IPs 199.34.228.47
Redirect IPs 199.34.228.47
Response IP 199.34.228.47
Found Yes
Hash cdfafd82441caecac0569b933e4bf97b4874f8c3e122a0c7198a1eb4c7fdc998
SimHash 0a44e88a4b92

Groups

nerdybot

Rule Path
Disallow /

*

Rule Path
Disallow /ajax/
Disallow /apps/
Disallow /terms.html
Disallow /privacy.html

Other Records

Field Value
sitemap https://www.scanmydocument.com/sitemap.xml