davidbergman.net
robots.txt

Robots Exclusion Standard data for davidbergman.net

Resource Scan

Scan Details

Site Domain davidbergman.net
Base Domain davidbergman.net
Scan Status Ok
Last Scan2025-12-24T09:02:26+00:00
Next Scan 2026-01-07T09:02:26+00:00

Last Scan

Scanned2025-12-24T09:02:26+00:00
URL https://davidbergman.net/robots.txt
Domain IPs 65.181.116.15
Response IP 65.181.116.15
Found Yes
Hash 7e86818e025f74eb71e82444dc43aecb85fcd1ad301a9a7ba39d859c492e2e23
SimHash a273740a8bd2

Groups

ia_archiver

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/
Disallow /errors/
Disallow /images/
Disallow /scripts/
Disallow /alicebraga/
Disallow /alicemarie/
Disallow /alicemariesreveries/
Disallow /amandabergman/
Disallow /platypuspark/
Disallow /teawithalicemarie/
Disallow /tourphotographer/
Disallow /tourphotographers/
Disallow /tourphotography/
Disallow /wonderlandpublishinggroup/

Comments

  • Prevent archive.org