intel471.com
robots.txt

Robots Exclusion Standard data for intel471.com

Resource Scan

Scan Details

Site Domain intel471.com
Base Domain intel471.com
Scan Status Ok
Last Scan2024-05-03T13:42:02+00:00
Next Scan 2024-05-10T13:42:02+00:00

Last Scan

Scanned2024-05-03T13:42:02+00:00
URL https://intel471.com/robots.txt
Domain IPs 104.26.14.158, 104.26.15.158, 172.67.71.97, 2606:4700:20::681a:e9e, 2606:4700:20::681a:f9e, 2606:4700:20::ac43:4761
Response IP 104.26.15.158
Found Yes
Hash 8e91a4b8249fe0e7767dc8c54d9c8c449e12bf33be98ba082498c40e9bf7b304
SimHash 43401d523593

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://intel471.com/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://intel471.com/
  • live - don't allow web crawlers to index cpresources/ or vendor/