truongtotnhat.vn
robots.txt
Robots Exclusion Standard data for truongtotnhat.vn
Resource Scan
Scan Details
| Site Domain | truongtotnhat.vn |
| Base Domain | truongtotnhat.vn |
| Scan Status | Ok |
| Last Scan | 2025-12-09T08:13:39+00:00 |
| Next Scan | 2026-01-08T08:13:39+00:00 |
Last Scan
| Scanned | 2025-12-09T08:13:39+00:00 |
| URL | https://truongtotnhat.vn/robots.txt |
| Domain IPs | 104.21.92.121, 172.67.193.1, 2606:4700:3033::ac43:c101, 2606:4700:3037::6815:5c79 |
| Response IP | 172.67.193.1 |
| Found | Yes |
| Hash | d526016ff45b3e338b0510d09a440309aa8e525395c403f5463ba05a1386c5cf |
| SimHash | 663419d1c523 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
*
| Rule | Path | Comment |
|---|---|---|
| Disallow | /wp-admin/ | - |
| Disallow | /wp-includes/ | - |
| Disallow | /cdn-cgi/ | - |
| Disallow | /cdn-cgi/* | - |
| Disallow | /cgi-bin/ | - |
| Disallow | /wp-content/themes/flatsome/assets/js | - |
| Disallow | /wp-content/themes/flatsome/assets/js/* | - |
| Disallow | /wp-content/litespeed/js/* | - |
| Disallow | /san-pham/ | Chặn thư mục sản phẩm |
| Disallow | /*?doing_wp_cron | - |
| Disallow | /*/ceylonthemes.com | - |
| Disallow | /readme.html | - |
| Disallow | /license.txt | - |
| Disallow | /index.html | - |
| Disallow | /feed/$ | - |
| Disallow | /feed | - |
| Disallow | /atom | - |
| Disallow | /tag/ | - |
| Disallow | /search/ | - |
| Disallow | /search_user | - |
| Disallow | /gio-hang | - |
| Disallow | /thanh-toan | - |
| Disallow | /tai-khoan/* | - |
| Disallow | /user/ | - |
| Disallow | /quantri | - |
| Disallow | /quantri?* | - |
| Disallow | %26p%3D | - |
| Disallow | *?sp_atk | - |
| Disallow | *utm_source | - |
| Disallow | /search?keyword=.com | - |
| Disallow | /search?keyword=.tv | - |
| Disallow | /search?keyword=.xyz | - |
| Disallow | /search?keyword=%C3%83%E2%80%9E%C3%86%E2%80%99%C3%83%C2%A2%C3%A2%E2%82%AC%C5%A1%C3%82%C2%AC%C3%83%C2%A2%C3%A2%E2%80%9A%C2%ACCom | - |
| Disallow | /search?keyword=%C3%84%E2%80%9A%C3%A2%E2%82%AC%C5%A1%C3%83%E2%80%9A%C3%82%C2%B7COM | - |
| Disallow | /search?category=.com | - |
| Disallow | /search?category=.tv | - |
| Disallow | /search?category=.xyz | - |
| Disallow | /search?category=*%C3%83%E2%80%9E%C3%86%E2%80%99%C3%83%C2%A2%C3%A2%E2%82%AC%C5%A1%C3%82%C2%AC%C3%83%C2%A2%C3%A2%E2%80%9A%C2%ACCom | - |
| Disallow | /search?category=*%C3%84%E2%80%9A%C3%A2%E2%82%AC%C5%A1%C3%83%E2%80%9A%C3%82%C2%B7COM | - |
| Disallow | /thumbs/* | - |
| Disallow | /wp-content/plugins/table-of-contents-plus/front.min.js | - |
| Disallow | /wp-includes/js/jquery/jquery.min.js | - |
| Disallow | /wp-content/themes/flatsome/inc/extensions/flatsome-instant-page/flatsome-instant-page.js | - |
| Disallow | /wp-content/plugins/wpforms-lite/assets/lib/jquery.validate.min.js | - |
| Disallow | */trackback | - |
gptbot
ccbot
anthropic-ai
omgilibot
diffbot
imagesiftbot
perplexitybot
cohere-ai
chatgpt-user
bytespider
magpie-crawler
petalbot
claude-web
ai2bot
ai2bot-dolma
alphaai
friendlycrawler
iaskspider/2.0
icc-crawler
isscyberriskcrawler
img2dataset
kangaroo bot
oai-searchbot
scrapy
sidetrade indexer bot
timpibot
velenpublicwebcrawler
webzio-extended
youbot
wget
httrack
linkwalker
emailcollector
exabot
| Rule | Path |
|---|---|
| Disallow | / |
| Allow | /wp-admin/admin-ajax.php |
Other Records
| Field | Value |
|---|---|
| sitemap | https://truongtotnhat.vn/sitemap_index.xml |
Warnings
- 6 invalid lines.
- `content-signal` is not a known field.
Comments