de.yehwang.com
robots.txt

Robots Exclusion Standard data for de.yehwang.com

Resource Scan

Scan Details

Site Domain de.yehwang.com
Base Domain yehwang.com
Scan Status Ok
Last Scan2026-01-29T03:55:28+00:00
Next Scan 2026-02-28T03:55:28+00:00

Last Scan

Scanned2026-01-29T03:55:28+00:00
URL https://de.yehwang.com/robots.txt
Domain IPs 104.18.4.81, 104.18.5.81, 2606:4700::6812:451, 2606:4700::6812:551
Response IP 104.18.4.81
Found Yes
Hash 1c6d9787866c9c1a8278122e7d5f750ef43a510a06ea5dc94bd1e6d1c7cbf971
SimHash a218d9818fc3

Groups

*

Rule Path
Disallow /index/*
Disallow /us/*
Disallow /eu/*
Disallow /web/*
Disallow /livewire/*
Disallow /api/*
Disallow /index.php?route=*
Disallow *?product_id=*
Disallow *?vlog_id=*
Disallow /%7B%221%22

seekportbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://de.yehwang.com/sitemap.xml
sitemap https://de.yehwang.com/sitemap2.xml
sitemap https://de.yehwang.com/sitemap-image/sitemap-image-index.xml

Comments

  • <URL:http://www.robotstxt.org/wc/exclusion.html#robotstxt>
  • Format is:
  • User-agent: <name of spider>
  • Disallow: <nothing> | <path>
  • -----------------------------------------------------------------------------