insightlsat.com
robots.txt

Robots Exclusion Standard data for insightlsat.com

Resource Scan

Scan Details

Site Domain insightlsat.com
Base Domain insightlsat.com
Scan Status Ok
Last Scan2025-12-24T20:16:16+00:00
Next Scan 2026-01-23T20:16:16+00:00

Last Scan

Scanned2025-12-24T20:16:16+00:00
URL https://insightlsat.com/robots.txt
Domain IPs 13.35.37.121, 13.35.37.28, 13.35.37.56, 13.35.37.70
Response IP 13.35.37.56
Found Yes
Hash f33fa619fb975806c1361db14d8638c7b2300492626318738ea6ce2c527cef3c
SimHash 29448e008130

Groups

*

Rule Path
Disallow

yandexbot

Rule Path
Disallow /

*

Rule Path
Allow /$
Allow /about$
Allow /contact$
Disallow /_next/static/
Disallow /_next/
Disallow /fonts/
Disallow /icons/
Disallow /*.js$
Disallow /*.css$
Disallow /*.svg$
Disallow /*.png$
Disallow /*.ttf$
Disallow /*.otf$
Disallow /*.eot$
Disallow /*.woff$
Disallow /*.woff2$
Disallow /*.map$

Other Records

Field Value
sitemap https://insightlsat.com/sitemap.xml

Comments

  • Block Yandex completely
  • Rules for all other crawlers