integralcasa.ro
robots.txt

Robots Exclusion Standard data for integralcasa.ro

Resource Scan

Scan Details

Site Domain integralcasa.ro
Base Domain integralcasa.ro
Scan Status Ok
Last Scan2025-10-03T07:52:13+00:00
Next Scan 2025-10-10T07:52:13+00:00

Last Scan

Scanned2025-10-03T07:52:13+00:00
URL https://integralcasa.ro/robots.txt
Redirect https://www.integralcasa.ro/static/robots.txt
Redirect Domain www.integralcasa.ro
Redirect Base integralcasa.ro
Domain IPs 195.160.162.147
Redirect IPs 195.160.162.147
Response IP 195.160.162.147
Found Yes
Hash 593b6df7c847c69dd689fe3914523eefb2f9b559d63ce88ba2b7ae9b631422f3
SimHash 3b1ed136a9d8

Groups

*

Rule Path
Disallow /feed/
Disallow /html/
Disallow /scripts/
Disallow /invoice
Disallow /*?f=*
Disallow /*?fb_xd_fragment
Disallow /*?qty=*

Other Records

Field Value
crawl-delay 5

ahrefsbot
amazonbot
baiduspider
becomebot
blexbot
ccbot
etaospider
exabot
mj12bot
npbot
shopwiki
smtbot
sogouspider
sogou web spider
stanford
stanford comp sci
synthesio
yandexbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.integralcasa.ro/sitemap.xml

Comments

  • No need to index this stuff
  • Avoid duplicate content as it may do more harm than good
  • Disallow bad bots

Warnings

  • 1 invalid line.