inantrantung.com
robots.txt

Robots Exclusion Standard data for inantrantung.com

Resource Scan

Scan Details

Site Domain inantrantung.com
Base Domain inantrantung.com
Scan Status Ok
Last Scan2025-09-22T07:52:29+00:00
Next Scan 2025-10-22T07:52:29+00:00

Last Scan

Scanned2025-09-22T07:52:29+00:00
URL https://inantrantung.com/robots.txt
Domain IPs 104.21.85.194, 172.67.209.143, 2606:4700:3033::6815:55c2, 2606:4700:3033::ac43:d18f
Response IP 104.21.85.194
Found Yes
Hash 0c88521b110e1ba60c8b46b4686d17a26dceab8b1aea0d8cc4b5afe722042a2a
SimHash 49000832ecbb

Groups

*

Rule Path
Disallow */page/
Disallow /feed/
Disallow /*/feed/
Disallow /*?view=
Disallow /*?no_cache=
Disallow /*?amp=
Allow /wp-content/uploads/
Disallow /wp-content/uploads/wc-logs/
Disallow /wp-content/uploads/woocommerce_transient_files/
Disallow /wp-content/uploads/woocommerce_uploads/
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://www.inantrantung.com/sitemap_index.xml

Comments

  • Chặn phân trang (tránh thin content)
  • Chặn feed
  • Chặn tham số rác
  • Cho phép sitemap