thienthaitruyen2.com
robots.txt

Robots Exclusion Standard data for thienthaitruyen2.com

Resource Scan

Scan Details

Site Domain thienthaitruyen2.com
Base Domain thienthaitruyen2.com
Scan Status Ok
Last Scan2026-02-15T23:39:07+00:00
Next Scan 2026-03-17T23:39:07+00:00

Last Scan

Scanned2026-02-15T23:39:07+00:00
URL https://thienthaitruyen2.com/robots.txt
Redirect https://thienthaitruyen5.com/robots.txt
Redirect Domain thienthaitruyen5.com
Redirect Base thienthaitruyen5.com
Domain IPs 104.21.10.127, 172.67.190.40, 2606:4700:3031::6815:a7f, 2606:4700:3036::ac43:be28
Redirect IPs 104.21.53.27, 172.67.208.13, 2606:4700:3033::ac43:d00d, 2606:4700:3035::6815:351b
Response IP 172.67.208.13
Found Yes
Hash 8e1247043afc21dbbe89907c0ac3b0b8de73191d8101435d39e148c3cfe4b009
SimHash 0a14d2d880a2

Groups

*

Rule Path
Allow /
Disallow /dang-nhap
Disallow /dang-ky
Disallow /tim-kiem-nang-cao?genres=*
Disallow /tim-kiem-nang-cao?name=*
Disallow /tim-kiem-nang-cao?page=*
Disallow /truyen-tranh/*/chuong*

dmcaagent

Rule Path
Disallow /

copyrightcrawler

Rule Path
Disallow /

antipiracybot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

goo-crawler

Rule Path
Disallow /

rakutenbot

Rule Path
Disallow /

teoma

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

musobot

Rule Path
Disallow /

link-busterbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://thienthaitruyen5.com/sitemap.xml

Warnings

  • 4 invalid lines.