dailythanthi.com
robots.txt

Robots Exclusion Standard data for dailythanthi.com

Resource Scan

Scan Details

Site Domain dailythanthi.com
Base Domain dailythanthi.com
Scan Status Ok
Last Scan2024-11-14T08:44:29+00:00
Next Scan 2024-11-21T08:44:29+00:00

Last Scan

Scanned2024-11-14T08:44:29+00:00
URL https://dailythanthi.com/robots.txt
Redirect https://www.dailythanthi.com/robots.txt
Redirect Domain www.dailythanthi.com
Redirect Base dailythanthi.com
Domain IPs 65.1.90.145
Redirect IPs 23.52.171.58, 23.59.168.153, 2600:1413:b000:1c::17d1:2ed2, 2600:1413:b000:1c::17d1:2ee0
Response IP 184.28.229.200
Found Yes
Hash 131ed4ecc079c1b2156af128873ca530f5f4712fed23e4f7e7e37b5190113066
SimHash 881a56036d03

Groups

*

Rule Path
Allow /
Disallow /admin/*
Disallow /search/*
Disallow /search?*
Disallow /xhr/*
Disallow /preview/story-*
Disallow /amp/preview/story-*
Disallow /alfoo
Disallow /sildoo
Disallow /dutas
Disallow /metsmall
Disallow /advance-search?*
Disallow /forgotPassword
Disallow /register
Disallow /subscription
Disallow /login?*
Disallow /user?*
Disallow /forgotPassword?*
Disallow /reset-password?*
Disallow /verify-email?*
Disallow /app-lite?*
Disallow /app-lite
Disallow /advance-search
Disallow /search
Disallow /h-social-login/*
Disallow /tags/*/page-*
Disallow /author
Disallow /h-pdf-viewer?*
Disallow /user-*
Disallow /web-stories/%5B0-9%5D*
Allow /xhr/getNewsMixin*
Allow /content/servlet/RDESController?*

Other Records

Field Value
sitemap https://www.dailythanthi.com/google_feeds.xml
sitemap https://www.dailythanthi.com/sitemap-daily.xml
sitemap https://www.dailythanthi.com/feeds.xml
sitemap https://www.dailythanthi.com/sitemap/sitemap-home.xml
sitemap https://www.dailythanthi.com/sitemap/sitemap-index.xml
sitemap https://www.dailythanthi.com/sitemap/photo-story-sitemap.xml

Comments

  • robots.txt for https://www.dailythanthi.com/