smatechnologies.com
robots.txt

Robots Exclusion Standard data for smatechnologies.com

Resource Scan

Scan Details

Site Domain smatechnologies.com
Base Domain smatechnologies.com
Scan Status Ok
Last Scan2025-06-09T13:13:26+00:00
Next Scan 2025-06-16T13:13:26+00:00

Last Scan

Scanned2025-06-09T13:13:26+00:00
URL https://smatechnologies.com/robots.txt
Domain IPs 104.26.6.122, 104.26.7.122, 172.67.68.109, 2606:4700:20::681a:67a, 2606:4700:20::681a:77a, 2606:4700:20::ac43:446d
Response IP 172.67.68.109
Found Yes
Hash f0ddb5db305f5ddbe8361754f5d2a39cfe8829f8064e46a5fee669a9b92ac1f4
SimHash 0f221c527f12

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /*?utm_source=
Disallow /*?utm_medium=
Disallow /*?utm_campaign=

Other Records

Field Value
sitemap https://smatechnologies.com/fr/sitemaps-4-sitemap.xml
sitemap https://smatechnologies.com/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://smatechnologies.com/
  • live - don't allow web crawlers to index cpresources/ or vendor/