mail-archive.com
robots.txt

Robots Exclusion Standard data for mail-archive.com

Resource Scan

Scan Details

Site Domain mail-archive.com
Base Domain mail-archive.com
Scan Status Ok
Last Scan2025-12-26T07:26:41+00:00
Next Scan 2026-01-25T07:26:41+00:00

Last Scan

Scanned2025-12-26T07:26:41+00:00
URL https://mail-archive.com/robots.txt
Domain IPs 104.21.79.212, 172.67.148.206, 2606:4700:3030::ac43:94ce, 2606:4700:3037::6815:4fd4
Response IP 172.67.148.206
Found Yes
Hash 93358ef4ddd26304f067ad1908673a606f7597712aee2b5fff7374d7f5746784
SimHash 7bacdd134dd1

Groups

*

Rule Path
Disallow /cardedeu%40thesaurus.net
Disallow /cardedeu%40thesaurus.net
Disallow /mailto.php
Disallow /localization-request.php
Disallow /logo-request.php
Disallow phishwatch%40lists.clean-mx.com

Other Records

Field Value
sitemap https://www.mail-archive.com/sitemap_inc.xml
sitemap https://www.mail-archive.com/sitemap_full-2.xml
sitemap https://www.mail-archive.com/sitemap_full-1.xml
sitemap https://www.mail-archive.com/sitemap_full-0.xml

Comments

  • Based on the theory that spambots ignore this file
  • anyway, we block them at the web server level. All
  • robots are allowed in here.
  • Favor for the Catalan translation guy
  • Keep dynamic stuff away from search engines
  • Spam research lists confuse search engines