tomvalk.ro
robots.txt

Robots Exclusion Standard data for tomvalk.ro

Resource Scan

Scan Details

Site Domain tomvalk.ro
Base Domain tomvalk.ro
Scan Status Ok
Last Scan2025-09-07T14:52:17+00:00
Next Scan 2025-09-14T14:52:17+00:00

Last Scan

Scanned2025-09-07T14:52:17+00:00
URL https://tomvalk.ro/robots.txt
Redirect https://www.tomvalk.ro/static/robots.txt
Redirect Domain www.tomvalk.ro
Redirect Base tomvalk.ro
Domain IPs 195.160.162.77
Redirect IPs 195.160.162.77
Response IP 195.160.162.77
Found Yes
Hash cb750720012d75adbbe404d87ae0e4fd94b1d458f4a12e1990b4225ce95482c5
SimHash 2b1ed136a9d8

Groups

*

Rule Path
Disallow /feed/
Disallow /html/
Disallow /scripts/
Disallow /invoice
Disallow /*?f=*
Disallow /*?fb_xd_fragment
Disallow /*?qty=*

Other Records

Field Value
crawl-delay 5

ahrefsbot
amazonbot
baiduspider
becomebot
blexbot
ccbot
etaospider
exabot
mj12bot
npbot
shopwiki
smtbot
sogouspider
sogou web spider
stanford
stanford comp sci
synthesio
yandexbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.tomvalk.ro/sitemap.xml

Comments

  • No need to index this stuff
  • Avoid duplicate content as it may do more harm than good
  • Disallow bad bots

Warnings

  • 1 invalid line.