thinhphatcomputer.com
robots.txt

Robots Exclusion Standard data for thinhphatcomputer.com

Resource Scan

Scan Details

Site Domain thinhphatcomputer.com
Base Domain thinhphatcomputer.com
Scan Status Ok
Last Scan2026-02-12T00:09:08+00:00
Next Scan 2026-02-19T00:09:08+00:00

Last Scan

Scanned2026-02-12T00:09:08+00:00
URL https://thinhphatcomputer.com/robots.txt
Domain IPs 103.124.95.161
Response IP 103.124.95.161
Found Yes
Hash 61508c8b077cdfcfb4ab00c708b5961e3b82c74857786510dd006fadef53963b
SimHash 098a0a79eaa2

Groups

*

Rule Path
Allow /$
Allow /wp-admin/admin-ajax.php
Allow /wp-content/uploads/
Allow /wp-includes/js/
Allow /*.css$
Allow /*.js$
Disallow /wp-admin/
Disallow /wp-login.php
Disallow /cart/
Disallow /checkout/
Disallow /my-account/
Disallow /add-to-cart/
Disallow /*add-to-cart%3D*
Disallow /wishlist/
Disallow /compare/
Disallow /search/
Disallow /*?s=*
Disallow /*?orderby=*
Disallow /*?filter_*
Disallow /*?rating=*
Disallow /*?utm_*
Disallow /*?gclid=*
Disallow /*?fbclid=*
Disallow /cgi-bin/
Disallow /tmp/
Disallow /feed/
Disallow /*/feed/

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

semrushbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://thinhphatcomputer.com/sitemap.xml
sitemap https://thinhphatcomputer.com/sitemap_index.xml

Comments

  • robots.txt for https://thinhphatcomputer.com/
  • CHO CRAWL nội dung quan trọng
  • CHẶN khu vá»±c kỹ thuật / mỏng nội dung / trùng lặp
  • Tốc độ crawl cho một số bot "nặng" (Google bỏ qua Crawl-delay, Bing/Yandex có dùng)
  • Khai báo sitemap