anarcho-copy.org
robots.txt

Robots Exclusion Standard data for anarcho-copy.org

Resource Scan

Scan Details

Site Domain anarcho-copy.org
Base Domain anarcho-copy.org
Scan Status Ok
Last Scan2024-05-10T05:57:28+00:00
Next Scan 2024-06-09T05:57:28+00:00

Last Scan

Scanned2024-05-10T05:57:28+00:00
URL https://anarcho-copy.org/robots.txt
Domain IPs 104.21.24.148, 172.67.219.66, 2606:4700:3033::ac43:db42, 2606:4700:3034::6815:1894
Response IP 104.21.24.148
Found Yes
Hash 641cae021e53e6f60971592d2df8bb7a27fa1ce4cf38398e0a9eb33754447577
SimHash 600504652ad3

Groups

*

Rule Path
Disallow /archive.tar

Other Records

Field Value
sitemap https://anarcho-copy.org/sitemap.xml
sitemap https://anarcho-copy.org/sitemap.txt

Comments

  • Instant mirror:
  • wget -q -O - https://anarcho-copy.org/mirror.txt | wget -x -N -q -i -
  • Instant mirror (http only):
  • wget -q -O - http://anarcho-copy.org/mirror.txt | sed 's%https://%http://%g' | wget -x -N -q -i -