pensulemachiaj.ro
robots.txt

Robots Exclusion Standard data for pensulemachiaj.ro

Resource Scan

Scan Details

Site Domain pensulemachiaj.ro
Base Domain pensulemachiaj.ro
Scan Status Ok
Last Scan2024-06-13T03:40:05+00:00
Next Scan 2024-06-20T03:40:05+00:00

Last Scan

Scanned2024-06-13T03:40:05+00:00
URL https://pensulemachiaj.ro/robots.txt
Redirect https://www.pensulemachiaj.ro/static/robots.txt
Redirect Domain www.pensulemachiaj.ro
Redirect Base pensulemachiaj.ro
Domain IPs 104.26.4.243, 104.26.5.243, 172.67.69.101, 2606:4700:20::681a:4f3, 2606:4700:20::681a:5f3, 2606:4700:20::ac43:4565
Redirect IPs 104.26.4.243, 104.26.5.243, 172.67.69.101, 2606:4700:20::681a:4f3, 2606:4700:20::681a:5f3, 2606:4700:20::ac43:4565
Response IP 104.26.4.243
Found Yes
Hash 843059c353e9dcd7aa61864ab42dc2156577074bc21200e00db0d766410fde1d
SimHash aa0cb91aa9f8

Groups

*

Rule Path
Disallow /feed/
Disallow /html/
Disallow /scripts/
Disallow /invoice
Disallow /*?f=*
Disallow /*?fb_xd_fragment
Disallow /*?qty=*
Disallow /*recommend?*
Disallow /*2pau
Disallow /*2ptt
Disallow /*2ptu
Disallow /*2prp
Disallow /*2pdlst

Other Records

Field Value
crawl-delay 5

ahrefsbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

etaospider

Rule Path
Disallow /

npbot

Rule Path
Disallow /

synthesio

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.pensulemachiaj.ro/sitemap.xml

Comments

  • No need to index this stuff
  • Avoid duplicate content as it may do more harm than good
  • Disallow UTM BigBear params
  • Disallow some bots we do not care about

Warnings

  • 2 invalid lines.