truongtotnhat.vn
robots.txt

Robots Exclusion Standard data for truongtotnhat.vn

Resource Scan

Scan Details

Site Domain truongtotnhat.vn
Base Domain truongtotnhat.vn
Scan Status Ok
Last Scan2025-12-09T08:13:39+00:00
Next Scan 2026-01-08T08:13:39+00:00

Last Scan

Scanned2025-12-09T08:13:39+00:00
URL https://truongtotnhat.vn/robots.txt
Domain IPs 104.21.92.121, 172.67.193.1, 2606:4700:3033::ac43:c101, 2606:4700:3037::6815:5c79
Response IP 172.67.193.1
Found Yes
Hash d526016ff45b3e338b0510d09a440309aa8e525395c403f5463ba05a1386c5cf
SimHash 663419d1c523

Groups

*

Rule Path
Allow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

*

Rule Path Comment
Disallow /wp-admin/ -
Disallow /wp-includes/ -
Disallow /cdn-cgi/ -
Disallow /cdn-cgi/* -
Disallow /cgi-bin/ -
Disallow /wp-content/themes/flatsome/assets/js -
Disallow /wp-content/themes/flatsome/assets/js/* -
Disallow /wp-content/litespeed/js/* -
Disallow /san-pham/ Chặn thư mục sản phẩm
Disallow /*?doing_wp_cron -
Disallow /*/ceylonthemes.com -
Disallow /readme.html -
Disallow /license.txt -
Disallow /index.html -
Disallow /feed/$ -
Disallow /feed -
Disallow /atom -
Disallow /tag/ -
Disallow /search/ -
Disallow /search_user -
Disallow /gio-hang -
Disallow /thanh-toan -
Disallow /tai-khoan/* -
Disallow /user/ -
Disallow /quantri -
Disallow /quantri?* -
Disallow %26p%3D -
Disallow *?sp_atk -
Disallow *utm_source -
Disallow /search?keyword=.com -
Disallow /search?keyword=.tv -
Disallow /search?keyword=.xyz -
Disallow /search?keyword=%C3%83%E2%80%9E%C3%86%E2%80%99%C3%83%C2%A2%C3%A2%E2%82%AC%C5%A1%C3%82%C2%AC%C3%83%C2%A2%C3%A2%E2%80%9A%C2%ACCom -
Disallow /search?keyword=%C3%84%E2%80%9A%C3%A2%E2%82%AC%C5%A1%C3%83%E2%80%9A%C3%82%C2%B7COM -
Disallow /search?category=.com -
Disallow /search?category=.tv -
Disallow /search?category=.xyz -
Disallow /search?category=*%C3%83%E2%80%9E%C3%86%E2%80%99%C3%83%C2%A2%C3%A2%E2%82%AC%C5%A1%C3%82%C2%AC%C3%83%C2%A2%C3%A2%E2%80%9A%C2%ACCom -
Disallow /search?category=*%C3%84%E2%80%9A%C3%A2%E2%82%AC%C5%A1%C3%83%E2%80%9A%C3%82%C2%B7COM -
Disallow /thumbs/* -
Disallow /wp-content/plugins/table-of-contents-plus/front.min.js -
Disallow /wp-includes/js/jquery/jquery.min.js -
Disallow /wp-content/themes/flatsome/inc/extensions/flatsome-instant-page/flatsome-instant-page.js -
Disallow /wp-content/plugins/wpforms-lite/assets/lib/jquery.validate.min.js -
Disallow */trackback -

gptbot
ccbot
anthropic-ai
omgilibot
diffbot
imagesiftbot
perplexitybot
cohere-ai
chatgpt-user
bytespider
magpie-crawler
petalbot
claude-web
ai2bot
ai2bot-dolma
alphaai
friendlycrawler
iaskspider/2.0
icc-crawler
isscyberriskcrawler
img2dataset
kangaroo bot
oai-searchbot
scrapy
sidetrade indexer bot
timpibot
velenpublicwebcrawler
webzio-extended
youbot
wget
httrack
linkwalker
emailcollector
exabot

Rule Path
Disallow /
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://truongtotnhat.vn/sitemap_index.xml

Comments

  • As a condition of accessing this website, you agree to abide by the following
  • content signals:
  • (a) If a content-signal = yes, you may collect content for the corresponding
  • use.
  • (b) If a content-signal = no, you may not collect content for the
  • corresponding use.
  • (c) If the website operator does not include a content signal for a
  • corresponding use, the website operator neither grants nor restricts
  • permission via content signal with respect to the corresponding use.
  • The content signals and their meanings are:
  • search: building a search index and providing search results (e.g., returning
  • hyperlinks and short excerpts from your website's contents). Search does not
  • include providing AI-generated search summaries.
  • ai-input: inputting content into one or more AI models (e.g., retrieval
  • augmented generation, grounding, or other real-time taking of content for
  • generative AI search answers).
  • ai-train: training or fine-tuning AI models.
  • ANY RESTRICTIONS EXPRESSED VIA CONTENT SIGNALS ARE EXPRESS RESERVATIONS OF
  • RIGHTS UNDER ARTICLE 4 OF THE EUROPEAN UNION DIRECTIVE 2019/790 ON COPYRIGHT
  • AND RELATED RIGHTS IN THE DIGITAL SINGLE MARKET.
  • BEGIN Cloudflare Managed content
  • END Cloudflare Managed Content
  • Basic crawling rules
  • Core files protection
  • Feed and navigation
  • E-commerce pages
  • Search parameters
  • Search keyword restrictions
  • Media and assets
  • Trackback
  • AI and Bot restrictions
  • Sitemap

Warnings

  • 6 invalid lines.
  • `content-signal` is not a known field.