thehome.ro
robots.txt

Robots Exclusion Standard data for thehome.ro

Resource Scan

Scan Details

Site Domain thehome.ro
Base Domain thehome.ro
Scan Status Ok
Last Scan2024-09-14T11:16:58+00:00
Next Scan 2024-09-21T11:16:58+00:00

Last Scan

Scanned2024-09-14T11:16:58+00:00
URL https://thehome.ro/robots.txt
Redirect https://www.thehome.ro/static/robots.txt
Redirect Domain www.thehome.ro
Redirect Base thehome.ro
Domain IPs 136.243.114.236
Redirect IPs 136.243.114.236
Response IP 136.243.114.236
Found Yes
Hash 1bcc447c7ac3c9903210cb21293797ada4b7a15e337984e7c9bc28ec02c3f392
SimHash 2e2cdf58a9e9

Groups

*

Rule Path
Disallow /feed/
Disallow /html/
Disallow /scripts/
Disallow /invoice
Disallow /*?f=*
Disallow /*?fb_xd_fragment
Disallow /*?qty=*
Disallow *fbcapi
Disallow *?page=
Disallow *?s=did
Disallow *?s=price
Disallow *?s=newest
Disallow *?s=pa
Disallow *?sort=
Disallow *?promo=
Disallow *?type=
Disallow *product_review
Disallow *?s=pd
Disallow *?action=
Disallow *?s=na
Disallow *?s=name
Disallow *?pg=
Disallow *?s=popularity
Disallow *?s=price
Disallow *?stock=on
Disallow *?product_id=
Disallow *?c=
Disallow *?view_type=
Disallow *?s=rd
Disallow *?new=
Disallow *?price
Disallow /*2pau
Disallow /*2ptt
Disallow /*2ptu
Disallow /*2prp
Disallow /*2pdlst

Other Records

Field Value
crawl-delay 5

ahrefsbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

etaospider

Rule Path
Disallow /

npbot

Rule Path
Disallow /

synthesio

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.thehome.ro/sitemap.xml

Comments

  • No need to index this stuff
  • Avoid duplicate content as it may do more harm than good
  • Other irrelevant pages to block from crawling
  • 2Performant
  • Disallow some bots we do not care about

Warnings

  • 2 invalid lines.