depurtat.ro
robots.txt

Robots Exclusion Standard data for depurtat.ro

Resource Scan

Scan Details

Site Domain depurtat.ro
Base Domain depurtat.ro
Scan Status Ok
Last Scan2024-09-11T20:18:08+00:00
Next Scan 2024-09-18T20:18:08+00:00

Last Scan

Scanned2024-09-11T20:18:08+00:00
URL https://depurtat.ro/robots.txt
Redirect https://www.depurtat.ro/static/robots.txt
Redirect Domain www.depurtat.ro
Redirect Base depurtat.ro
Domain IPs 136.243.114.231
Redirect IPs 136.243.114.231
Response IP 136.243.114.231
Found Yes
Hash 2b15106d386c50efbacd0cb1001df51ed3e19c62ae47e8fa8b2680d23ae6f1dd
SimHash 3b1cd936a9d8

Groups

*

Rule Path
Disallow /feed/
Disallow /html/
Disallow /scripts/
Disallow /invoice
Disallow /*?f=*
Disallow /*?fb_xd_fragment
Disallow /*?qty=*
Disallow /*2pau
Disallow /*2ptt
Disallow /*2ptu
Disallow /*2prp
Disallow /*2pdlst

Other Records

Field Value
crawl-delay 5

ahrefsbot
amazonbot
baiduspider
becomebot
ccbot
etaospider
exabot
mj12bot
npbot
shopwiki
smtbot
sogouspider
sogou web spider
stanford
stanford comp sci
synthesio
yandexbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.depurtat.ro/sitemap.xml

Comments

  • No need to index this stuff
  • Avoid duplicate content as it may do more harm than good
  • 2Performant
  • Disallow bad bots

Warnings

  • 1 invalid line.