thanthitv.com
robots.txt

Robots Exclusion Standard data for thanthitv.com

Resource Scan

Scan Details

Site Domain thanthitv.com
Base Domain thanthitv.com
Scan Status Ok
Last Scan2024-10-04T13:59:17+00:00
Next Scan 2024-10-11T13:59:17+00:00

Last Scan

Scanned2024-10-04T13:59:17+00:00
URL https://thanthitv.com/robots.txt
Redirect https://www.thanthitv.com/robots.txt
Redirect Domain www.thanthitv.com
Redirect Base thanthitv.com
Domain IPs 13.234.207.199
Redirect IPs 108.156.133.17, 108.156.133.31, 108.156.133.54, 108.156.133.6
Response IP 108.156.133.6
Found Yes
Hash 8e4a55ef96f7f0bf538d17d033954673e887ab881bd24a01808dfac5a1f2165a
SimHash c85a1e4bddd3

Groups

*

Rule Path
Allow /
Disallow /admin/*
Disallow /search/*
Disallow /search?*
Disallow /xhr/*
Disallow /preview/story-*
Disallow /amp/preview/story-*
Disallow /alfoo
Disallow /sildoo
Disallow /dutas
Disallow /video-only-for-doctors-72491
Allow /xhr/getNewsMixin*
Allow /content/servlet/RDESController?*
Disallow /advance-search
Disallow /forgotPassword
Disallow /register
Disallow /subscription
Disallow /login
Disallow /user
Disallow /reset-password
Disallow /verify-email
Disallow /app-lite

Other Records

Field Value
sitemap https://www.thanthitv.com/sitemap/sitemap-index.xml
sitemap https://www.thanthitv.com/news-sitemap-daily.xml
sitemap https://www.thanthitv.com/feeds.xml
sitemap https://www.thanthitv.com/google_feeds.xml
sitemap https://www.thanthitv.com/sitemap-daily.xml
sitemap https://www.thanthitv.com/sitemap/sitemap-home.xml

Comments

  • robots.txt for https://www.thanthitv.com/