archlynk.com
robots.txt

Robots Exclusion Standard data for archlynk.com

Resource Scan

Scan Details

Site Domain archlynk.com
Base Domain archlynk.com
Scan Status Ok
Last Scan2025-10-09T03:16:42+00:00
Next Scan 2025-10-16T03:16:42+00:00

Last Scan

Scanned2025-10-09T03:16:42+00:00
URL https://archlynk.com/robots.txt
Domain IPs 104.21.21.188, 172.67.199.207, 2606:4700:3032::6815:15bc, 2606:4700:3034::ac43:c7cf
Response IP 104.21.21.188
Found Yes
Hash ff931028fbd677de214344c49592ad49d4301bc1340f4cd019f4ef212891a618
SimHash 61721d727fb3

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cdn-cgi/l/email-protection

Other Records

Field Value
sitemap https://archlynk.com/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://archlynk.com/
  • Env: live
  • live - don't allow web crawlers to index cpresources/ or vendor/