thienthaitruyen4.com
robots.txt

Robots Exclusion Standard data for thienthaitruyen4.com

Resource Scan

Scan Details

Site Domain thienthaitruyen4.com
Base Domain thienthaitruyen4.com
Scan Status Ok
Last Scan2026-02-15T22:52:57+00:00
Next Scan 2026-03-17T22:52:57+00:00

Last Scan

Scanned2026-02-15T22:52:57+00:00
URL https://thienthaitruyen4.com/robots.txt
Redirect https://thienthaitruyen5.com/robots.txt
Redirect Domain thienthaitruyen5.com
Redirect Base thienthaitruyen5.com
Domain IPs 104.21.6.55, 172.67.154.245, 2606:4700:3030::6815:637, 2606:4700:3037::ac43:9af5
Redirect IPs 104.21.53.27, 172.67.208.13, 2606:4700:3033::ac43:d00d, 2606:4700:3035::6815:351b
Response IP 172.67.208.13
Found Yes
Hash 8e1247043afc21dbbe89907c0ac3b0b8de73191d8101435d39e148c3cfe4b009
SimHash 0a14d2d880a2

Groups

*

Rule Path
Allow /
Disallow /dang-nhap
Disallow /dang-ky
Disallow /tim-kiem-nang-cao?genres=*
Disallow /tim-kiem-nang-cao?name=*
Disallow /tim-kiem-nang-cao?page=*
Disallow /truyen-tranh/*/chuong*

dmcaagent

Rule Path
Disallow /

copyrightcrawler

Rule Path
Disallow /

antipiracybot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

goo-crawler

Rule Path
Disallow /

rakutenbot

Rule Path
Disallow /

teoma

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

musobot

Rule Path
Disallow /

link-busterbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://thienthaitruyen5.com/sitemap.xml

Warnings

  • 4 invalid lines.