almacasa.ro
robots.txt

Robots Exclusion Standard data for almacasa.ro

Resource Scan

Scan Details

Site Domain almacasa.ro
Base Domain almacasa.ro
Scan Status Ok
Last Scan2025-10-05T13:44:23+00:00
Next Scan 2025-10-12T13:44:23+00:00

Last Scan

Scanned2025-10-05T13:44:23+00:00
URL https://almacasa.ro/robots.txt
Redirect https://www.almacasa.ro/static/robots.txt
Redirect Domain www.almacasa.ro
Redirect Base almacasa.ro
Domain IPs 195.160.162.79
Redirect IPs 195.160.162.79
Response IP 195.160.162.79
Found Yes
Hash 3417732ea9a93279caec44b0abe720294c76847603184acb709587fa85630f1c
SimHash ae3cf912abf8

Groups

*

Rule Path
Disallow /feed/
Disallow /html/
Disallow /scripts/
Disallow /invoice
Disallow /q/
Disallow /tag/
Disallow /product_id%3D/
Disallow /*?f=*
Disallow /*?fb_xd_fragment
Disallow /*?qty=*

Other Records

Field Value
crawl-delay 5

ahrefsbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

etaospider

Rule Path
Disallow /

npbot

Rule Path
Disallow /

synthesio

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.almacasa.ro/sitemap.xml

Comments

  • No need to index this stuff
  • Avoid duplicate content as it may do more harm than good
  • Disallow some bots we do not care about

Warnings

  • 2 invalid lines.