alanardiff.com
robots.txt

Robots Exclusion Standard data for alanardiff.com

Resource Scan

Scan Details

Site Domain alanardiff.com
Base Domain alanardiff.com
Scan Status Ok
Last Scan2024-06-03T10:35:42+00:00
Next Scan 2024-06-10T10:35:42+00:00

Last Scan

Scanned2024-06-03T10:35:42+00:00
URL https://alanardiff.com/robots.txt
Domain IPs 104.21.2.169, 172.67.129.121, 2606:4700:3034::ac43:8179, 2606:4700:3037::6815:2a9
Response IP 104.21.2.169
Found Yes
Hash 5f21cbdfe00f803b09e069c0557953d34aaa218b79ea56403a5b544edf4ec14e
SimHash 214019763797

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://alanardiff.com/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://alanardiff.com/
  • live - don't allow web crawlers to index cpresources/ or vendor/