sofa.gr
robots.txt

Robots Exclusion Standard data for sofa.gr

Resource Scan

Scan Details

Site Domain sofa.gr
Base Domain sofa.gr
Scan Status Ok
Last Scan2025-12-31T10:42:12+00:00
Next Scan 2026-01-30T10:42:12+00:00

Last Scan

Scanned2025-12-31T10:42:12+00:00
URL https://sofa.gr/robots.txt
Domain IPs 93.174.123.71
Response IP 93.174.123.71
Found Yes
Hash 53f2be48077a88db7fb5b8f1ed91b6af948b7f1ea9040b5aa46b11fddf18691a
SimHash 171c2a80a516

Groups

*

Rule Path
Allow /

gptbot

Rule Path
Allow /

google-extended

Rule Path
Allow /

claudebot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /
Disallow /*tag%3D
Disallow /*search%3D
Disallow /*manufacturer_id%3D
Disallow /*sort%3D
Disallow /*order%3D
Disallow /*limit%3D
Disallow /*filter_name%3D
Disallow /*filter_sub_category%3D
Disallow /*filter_description%3D
Disallow /*?fa
Disallow /?*ft

Other Records

Field Value
sitemap https://www.sofa.gr/sitemap.xml

Comments

  • robots.txt for sofa.gr
  • Last updated: October 2025
  • Purpose: Standard search + AI crawler configuration
  • Sitemap
  • LLMs (AI Crawlers Instructions)
  • Optional (recommended) — if you want to explicitly allow AI bots:
  • Disallow sensitive admin paths
  • === Lightning code start
  • === Lightning code end
  • End of file

Warnings

  • `llms` is not a known field.
  • `llms-full` is not a known field.