widgets.hindustantimes.com
robots.txt

Robots Exclusion Standard data for widgets.hindustantimes.com

Resource Scan

Scan Details

Site Domain widgets.hindustantimes.com
Base Domain hindustantimes.com
Scan Status Ok
Last Scan2024-11-05T09:24:45+00:00
Next Scan 2024-11-12T09:24:45+00:00

Last Scan

Scanned2024-11-05T09:24:45+00:00
URL https://widgets.hindustantimes.com/robots.txt
Domain IPs 23.52.171.120, 23.52.171.145, 2600:1413:1::6011:4848, 2600:1413:1::7d38:db30
Response IP 23.45.207.173
Found Yes
Hash 956deda4d48534eb814df120e3d4d19643b967587ff98906e3979a540e48bbf0
SimHash c34059634973

Groups

*

Rule Path
Allow /
Disallow /what-now/card-details/
Disallow /*/url
Disallow /*/imageURL
Disallow /fragment/
Disallow /Fragment/
Disallow /Error/
Disallow /error/
Disallow /intfeeds/
Disallow /Images/HTEditImages
Disallow /images/HTPopups/
Disallow /Images/Popup/
Disallow /images/Popup/
Disallow /homenew
Disallow /dummytestpage/*
Disallow /brand-stories/international/*
Disallow /origin-pre-prod/*
Disallow /sponsored-stories/*
Disallow /brand-stories/*
Disallow /brand-post/*
Disallow /static-content/10s/us-election.html

Other Records

Field Value
sitemap https://www.hindustantimes.com/sitemap/section.xml
sitemap https://www.hindustantimes.com/sitemap/news.xml
sitemap https://www.hindustantimes.com/sitemap/index.xml