thethaothientruong.vn
robots.txt

Robots Exclusion Standard data for thethaothientruong.vn

Resource Scan

Scan Details

Site Domain thethaothientruong.vn
Base Domain thethaothientruong.vn
Scan Status Ok
Last Scan2025-10-07T05:52:16+00:00
Next Scan 2025-11-06T05:52:16+00:00

Last Scan

Scanned2025-10-07T05:52:16+00:00
URL https://thethaothientruong.vn/robots.txt
Domain IPs 103.200.23.236
Response IP 103.200.23.236
Found Yes
Hash 4a741a170c549a09648ea556cf91756243f6c3bc0bb5db05a88507c9575e8cb7
SimHash 7f501d22ce9b

Groups

*

Rule Path
Disallow /wp-includes/
Disallow /xmlrpc.php
Disallow /?s=
Disallow /search/
Disallow /tag/
Disallow /author/
Disallow /*?add-to-cart=
Disallow /*?orderby=
Disallow /*?filter_
Disallow /*?srslid=
Disallow /*?utm_source=
Disallow /*?utm_medium=
Disallow /*?utm_campaign=
Disallow /*?replytocom=
Disallow /*?remove_item=
Disallow /feed/
Disallow /*/feed/
Disallow /*/feed$
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://thethaothientruong.vn/sitemap_index.xml

Comments

  • Chặn thư mục không cần SEO
  • Chặn các trang tìm kiếm nội bộ, tag, author
  • Chặn các tham số rác & link động
  • Chặn RSS & feed
  • Cho phép ajax (bắt buộc để web hoạt động bình thường)
  • Sitemap

Warnings

  • 4 invalid lines.